Dataset statistics
| Number of variables | 69 |
|---|---|
| Number of observations | 42497 |
| Missing cells | 838314 |
| Missing cells (%) | 28.6% |
| Duplicate rows | 1364 |
| Duplicate rows (%) | 3.2% |
| Total size in memory | 19.3 MiB |
| Average record size in memory | 475.0 B |
Variable types
| Categorical | 45 |
|---|---|
| Numeric | 9 |
| Boolean | 11 |
| Unsupported | 4 |
| Dataset has 1364 (3.2%) duplicate rows | Duplicates |
Ref_publication_date has a high cardinality: 1890 distinct values | High cardinality |
Substrate_stack_sequence has a high cardinality: 194 distinct values | High cardinality |
ETL_stack_sequence has a high cardinality: 1444 distinct values | High cardinality |
ETL_thickness has a high cardinality: 1419 distinct values | High cardinality |
ETL_additives_compounds has a high cardinality: 447 distinct values | High cardinality |
ETL_deposition_procedure has a high cardinality: 368 distinct values | High cardinality |
Perovskite_composition_a_ions has a high cardinality: 264 distinct values | High cardinality |
Perovskite_composition_a_ions_coefficients has a high cardinality: 565 distinct values | High cardinality |
Perovskite_composition_b_ions has a high cardinality: 53 distinct values | High cardinality |
Perovskite_composition_b_ions_coefficients has a high cardinality: 148 distinct values | High cardinality |
Perovskite_composition_c_ions_coefficients has a high cardinality: 370 distinct values | High cardinality |
Perovskite_additives_compounds has a high cardinality: 979 distinct values | High cardinality |
Perovskite_additives_concentrations has a high cardinality: 706 distinct values | High cardinality |
Perovskite_deposition_procedure has a high cardinality: 198 distinct values | High cardinality |
Perovskite_deposition_synthesis_atmosphere has a high cardinality: 186 distinct values | High cardinality |
Perovskite_deposition_solvents has a high cardinality: 343 distinct values | High cardinality |
Perovskite_deposition_solvents_mixing_ratios has a high cardinality: 528 distinct values | High cardinality |
Perovskite_deposition_quenching_media has a high cardinality: 94 distinct values | High cardinality |
Perovskite_deposition_quenching_media_additives_compounds has a high cardinality: 84 distinct values | High cardinality |
Perovskite_deposition_thermal_annealing_temperature has a high cardinality: 870 distinct values | High cardinality |
Perovskite_deposition_thermal_annealing_time has a high cardinality: 766 distinct values | High cardinality |
Perovskite_deposition_after_treatment_of_formed_perovskite has a high cardinality: 103 distinct values | High cardinality |
HTL_stack_sequence has a high cardinality: 1959 distinct values | High cardinality |
HTL_thickness_list has a high cardinality: 439 distinct values | High cardinality |
HTL_additives_compounds has a high cardinality: 380 distinct values | High cardinality |
HTL_additives_concentrations has a high cardinality: 223 distinct values | High cardinality |
HTL_deposition_procedure has a high cardinality: 120 distinct values | High cardinality |
HTL_deposition_solvents has a high cardinality: 51 distinct values | High cardinality |
Backcontact_stack_sequence has a high cardinality: 289 distinct values | High cardinality |
Backcontact_thickness_list has a high cardinality: 384 distinct values | High cardinality |
Backcontact_deposition_procedure has a high cardinality: 113 distinct values | High cardinality |
Encapsulation_stack_sequence has a high cardinality: 118 distinct values | High cardinality |
Cell_area_measured is highly overall correlated with Module_area_total and 8 other fields | High correlation |
Module_area_total is highly overall correlated with Cell_area_measured and 7 other fields | High correlation |
Perovskite_deposition_number_of_deposition_steps is highly overall correlated with JV_scan_integration_time | High correlation |
JV_light_intensity is highly overall correlated with ETL_surface_treatment_before_next_deposition_step and 4 other fields | High correlation |
JV_scan_speed is highly overall correlated with ETL_surface_treatment_before_next_deposition_step and 6 other fields | High correlation |
JV_scan_delay_time is highly overall correlated with Cell_area_measured and 13 other fields | High correlation |
JV_preconditioning_time is highly overall correlated with Module_area_total and 18 other fields | High correlation |
JV_preconditioning_potential is highly overall correlated with Cell_flexible and 11 other fields | High correlation |
JV_default_PCE is highly overall correlated with Perovskite_surface_treatment_before_next_deposition_step | High correlation |
Cell_architecture is highly overall correlated with JV_scan_integration_time | High correlation |
Cell_flexible is highly overall correlated with JV_preconditioning_time and 4 other fields | High correlation |
Module is highly overall correlated with JV_scan_delay_time and 3 other fields | High correlation |
ETL_surface_treatment_before_next_deposition_step is highly overall correlated with Cell_area_measured and 19 other fields | High correlation |
Perovskite_dimension_2D is highly overall correlated with Module_area_total and 9 other fields | High correlation |
Perovskite_dimension_3D is highly overall correlated with JV_scan_delay_time and 5 other fields | High correlation |
Perovskite_dimension_3D_with_2D_capping_layer is highly overall correlated with JV_scan_delay_time and 8 other fields | High correlation |
Perovskite_composition_perovskite_ABC3_structure is highly overall correlated with JV_scan_delay_time and 7 other fields | High correlation |
Perovskite_composition_b_ions is highly overall correlated with JV_preconditioning_time and 6 other fields | High correlation |
Perovskite_composition_c_ions is highly overall correlated with Perovskite_dimension_3D_with_2D_capping_layer and 1 other fields | High correlation |
Perovskite_composition_none_stoichiometry_components_in_excess is highly overall correlated with Cell_area_measured and 5 other fields | High correlation |
Perovskite_band_gap_graded is highly overall correlated with Module_area_total and 8 other fields | High correlation |
Perovskite_deposition_quenching_induced_crystallisation is highly overall correlated with JV_preconditioning_time and 4 other fields | High correlation |
Perovskite_deposition_quenching_media is highly overall correlated with JV_preconditioning_time and 4 other fields | High correlation |
Perovskite_deposition_quenching_media_additives_compounds is highly overall correlated with Cell_area_measured and 17 other fields | High correlation |
Perovskite_deposition_thermal_annealing_atmosphere is highly overall correlated with ETL_surface_treatment_before_next_deposition_step and 1 other fields | High correlation |
Perovskite_deposition_solvent_annealing is highly overall correlated with JV_preconditioning_time and 3 other fields | High correlation |
Perovskite_deposition_solvent_annealing_solvent_atmosphere is highly overall correlated with JV_scan_delay_time and 5 other fields | High correlation |
Perovskite_surface_treatment_before_next_deposition_step is highly overall correlated with Cell_area_measured and 18 other fields | High correlation |
HTL_deposition_synthesis_atmosphere is highly overall correlated with ETL_surface_treatment_before_next_deposition_step and 2 other fields | High correlation |
HTL_deposition_solvents is highly overall correlated with Perovskite_surface_treatment_before_next_deposition_step and 4 other fields | High correlation |
HTL_deposition_solvents_mixing_ratios is highly overall correlated with Cell_area_measured and 13 other fields | High correlation |
Add_lay_front_stack_sequence is highly overall correlated with JV_scan_delay_time and 7 other fields | High correlation |
Add_lay_back is highly overall correlated with JV_scan_delay_time and 9 other fields | High correlation |
Encapsulation is highly overall correlated with JV_preconditioning_time and 4 other fields | High correlation |
JV_light_spectra is highly overall correlated with JV_preconditioning_time and 7 other fields | High correlation |
JV_scan_integration_time is highly overall correlated with Cell_area_measured and 25 other fields | High correlation |
JV_preconditioning_protocol is highly overall correlated with Cell_area_measured and 12 other fields | High correlation |
Cell_architecture is highly imbalanced (68.0%) | Imbalance |
Cell_flexible is highly imbalanced (83.0%) | Imbalance |
Module is highly imbalanced (93.2%) | Imbalance |
Substrate_stack_sequence is highly imbalanced (83.1%) | Imbalance |
ETL_stack_sequence is highly imbalanced (51.9%) | Imbalance |
ETL_additives_compounds is highly imbalanced (84.1%) | Imbalance |
ETL_deposition_procedure is highly imbalanced (56.2%) | Imbalance |
Perovskite_dimension_2D is highly imbalanced (83.9%) | Imbalance |
Perovskite_dimension_3D is highly imbalanced (79.7%) | Imbalance |
Perovskite_dimension_3D_with_2D_capping_layer is highly imbalanced (95.9%) | Imbalance |
Perovskite_composition_perovskite_ABC3_structure is highly imbalanced (86.7%) | Imbalance |
Perovskite_composition_a_ions is highly imbalanced (75.3%) | Imbalance |
Perovskite_composition_a_ions_coefficients is highly imbalanced (74.1%) | Imbalance |
Perovskite_composition_b_ions is highly imbalanced (92.3%) | Imbalance |
Perovskite_composition_b_ions_coefficients is highly imbalanced (91.3%) | Imbalance |
Perovskite_composition_c_ions is highly imbalanced (80.0%) | Imbalance |
Perovskite_composition_c_ions_coefficients is highly imbalanced (75.1%) | Imbalance |
Perovskite_composition_none_stoichiometry_components_in_excess is highly imbalanced (54.4%) | Imbalance |
Perovskite_band_gap_graded is highly imbalanced (98.5%) | Imbalance |
Perovskite_deposition_procedure is highly imbalanced (73.7%) | Imbalance |
Perovskite_deposition_synthesis_atmosphere is highly imbalanced (63.3%) | Imbalance |
Perovskite_deposition_solvents is highly imbalanced (58.6%) | Imbalance |
Perovskite_deposition_solvents_mixing_ratios is highly imbalanced (58.6%) | Imbalance |
Perovskite_deposition_quenching_media is highly imbalanced (67.1%) | Imbalance |
Perovskite_deposition_thermal_annealing_atmosphere is highly imbalanced (94.2%) | Imbalance |
Perovskite_deposition_solvent_annealing is highly imbalanced (85.3%) | Imbalance |
Perovskite_deposition_solvent_annealing_solvent_atmosphere is highly imbalanced (96.7%) | Imbalance |
Perovskite_deposition_after_treatment_of_formed_perovskite is highly imbalanced (56.2%) | Imbalance |
HTL_stack_sequence is highly imbalanced (64.7%) | Imbalance |
HTL_additives_compounds is highly imbalanced (73.8%) | Imbalance |
HTL_deposition_procedure is highly imbalanced (82.9%) | Imbalance |
HTL_deposition_synthesis_atmosphere is highly imbalanced (94.6%) | Imbalance |
HTL_deposition_solvents is highly imbalanced (94.0%) | Imbalance |
HTL_deposition_solvents_mixing_ratios is highly imbalanced (66.1%) | Imbalance |
Backcontact_stack_sequence is highly imbalanced (71.6%) | Imbalance |
Backcontact_thickness_list is highly imbalanced (60.2%) | Imbalance |
Backcontact_deposition_procedure is highly imbalanced (84.7%) | Imbalance |
Add_lay_front_stack_sequence is highly imbalanced (99.1%) | Imbalance |
Add_lay_back is highly imbalanced (99.2%) | Imbalance |
Encapsulation is highly imbalanced (75.3%) | Imbalance |
Encapsulation_stack_sequence is highly imbalanced (95.2%) | Imbalance |
JV_light_spectra is highly imbalanced (98.6%) | Imbalance |
JV_preconditioning_protocol is highly imbalanced (55.6%) | Imbalance |
Cell_area_measured has 711 (1.7%) missing values | Missing |
Module_area_total has 42165 (99.2%) missing values | Missing |
ETL_thickness has 23933 (56.3%) missing values | Missing |
ETL_additives_compounds has 3293 (7.7%) missing values | Missing |
ETL_surface_treatment_before_next_deposition_step has 42330 (99.6%) missing values | Missing |
Perovskite_composition_none_stoichiometry_components_in_excess has 35521 (83.6%) missing values | Missing |
Perovskite_additives_compounds has 28675 (67.5%) missing values | Missing |
Perovskite_additives_concentrations has 37606 (88.5%) missing values | Missing |
Perovskite_thickness has 29124 (68.5%) missing values | Missing |
Perovskite_band_gap has 10585 (24.9%) missing values | Missing |
Perovskite_pl_max has 32254 (75.9%) missing values | Missing |
Perovskite_deposition_solvents_mixing_ratios has 2125 (5.0%) missing values | Missing |
Perovskite_deposition_quenching_media_mixing_ratios has 41781 (98.3%) missing values | Missing |
Perovskite_deposition_quenching_media_additives_compounds has 41618 (97.9%) missing values | Missing |
Perovskite_deposition_after_treatment_of_formed_perovskite has 40873 (96.2%) missing values | Missing |
Perovskite_surface_treatment_before_next_deposition_step has 42493 (> 99.9%) missing values | Missing |
HTL_thickness_list has 32425 (76.3%) missing values | Missing |
HTL_additives_compounds has 18371 (43.2%) missing values | Missing |
HTL_additives_concentrations has 41462 (97.6%) missing values | Missing |
HTL_deposition_solvents_mixing_ratios has 41988 (98.8%) missing values | Missing |
Backcontact_thickness_list has 7976 (18.8%) missing values | Missing |
JV_light_spectra has 2491 (5.9%) missing values | Missing |
JV_scan_speed has 26362 (62.0%) missing values | Missing |
JV_scan_delay_time has 42370 (99.7%) missing values | Missing |
JV_scan_integration_time has 42477 (> 99.9%) missing values | Missing |
JV_preconditioning_protocol has 41497 (97.6%) missing values | Missing |
JV_preconditioning_time has 42279 (99.5%) missing values | Missing |
JV_preconditioning_potential has 42362 (99.7%) missing values | Missing |
JV_default_PCE has 925 (2.2%) missing values | Missing |
Cell_area_measured is highly skewed (γ1 = 193.1041949) | Skewed |
JV_light_intensity is highly skewed (γ1 = 71.51583535) | Skewed |
JV_scan_speed is highly skewed (γ1 = 122.3392709) | Skewed |
Perovskite_surface_treatment_before_next_deposition_step is uniformly distributed | Uniform |
Perovskite_thickness is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Perovskite_band_gap is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Perovskite_pl_max is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Perovskite_deposition_quenching_media_mixing_ratios is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2023-05-05 12:07:26.182989 |
|---|---|
| Analysis finished | 2023-05-05 12:07:57.994062 |
| Duration | 31.81 seconds |
| Software version | ydata-profiling vv4.1.2 |
| Download configuration | config.json |
Ref_publication_date
Categorical
| Distinct | 1890 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| 2017-05-15 | 161 |
|---|---|
| 2016-04-28 | 124 |
| 2018-12-13 | 113 |
| 2018-10-11 | 113 |
| 2018-01-04 | 106 |
| Other values (1885) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 424970 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 52 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2015-01-06 |
|---|---|
| 2nd row | 2015-01-06 |
| 3rd row | 2015-01-06 |
| 4th row | 2015-01-06 |
| 5th row | 2015-01-06 |
Common Values
| Value | Count | Frequency (%) |
| 2017-05-15 | 161 | 0.4% |
| 2016-04-28 | 124 | 0.3% |
| 2018-12-13 | 113 | 0.3% |
| 2018-10-11 | 113 | 0.3% |
| 2018-01-04 | 106 | 0.2% |
| 2017-08-23 | 105 | 0.2% |
| 2019-04-15 | 105 | 0.2% |
| 2019-03-26 | 104 | 0.2% |
| 2019-01-11 | 104 | 0.2% |
| 2018-04-24 | 101 | 0.2% |
| Other values (1880) | 41361 |
Length
| Value | Count | Frequency (%) |
| 2017-05-15 | 161 | 0.4% |
| 2016-04-28 | 124 | 0.3% |
| 2018-12-13 | 113 | 0.3% |
| 2018-10-11 | 113 | 0.3% |
| 2018-01-04 | 106 | 0.2% |
| 2017-08-23 | 105 | 0.2% |
| 2019-04-15 | 105 | 0.2% |
| 2019-03-26 | 104 | 0.2% |
| 2019-01-11 | 104 | 0.2% |
| 2018-04-24 | 101 | 0.2% |
| Other values (1880) | 41361 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 95874 | |
| - | 84994 | |
| 1 | 78206 | |
| 2 | 68402 | |
| 8 | 18889 | 4.4% |
| 9 | 18142 | 4.3% |
| 7 | 15875 | 3.7% |
| 6 | 14135 | 3.3% |
| 5 | 11366 | 2.7% |
| 3 | 9740 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 339976 | |
| Dash Punctuation | 84994 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 95874 | |
| 1 | 78206 | |
| 2 | 68402 | |
| 8 | 18889 | 5.6% |
| 9 | 18142 | 5.3% |
| 7 | 15875 | 4.7% |
| 6 | 14135 | 4.2% |
| 5 | 11366 | 3.3% |
| 3 | 9740 | 2.9% |
| 4 | 9347 | 2.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 84994 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 424970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 95874 | |
| - | 84994 | |
| 1 | 78206 | |
| 2 | 68402 | |
| 8 | 18889 | 4.4% |
| 9 | 18142 | 4.3% |
| 7 | 15875 | 3.7% |
| 6 | 14135 | 3.3% |
| 5 | 11366 | 2.7% |
| 3 | 9740 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 424970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 95874 | |
| - | 84994 | |
| 1 | 78206 | |
| 2 | 68402 | |
| 8 | 18889 | 4.4% |
| 9 | 18142 | 4.3% |
| 7 | 15875 | 3.7% |
| 6 | 14135 | 3.3% |
| 5 | 11366 | 2.7% |
| 3 | 9740 | 2.3% |
Cell_area_measured
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED 
| Distinct | 504 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 711 |
| Missing (%) | 1.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.20661691 |
| Minimum | 0.00012 |
|---|---|
| Maximum | 1063 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 332.1 KiB |
Quantile statistics
| Minimum | 0.00012 |
|---|---|
| 5-th percentile | 0.04 |
| Q1 | 0.09 |
| median | 0.1 |
| Q3 | 0.12 |
| 95-th percentile | 0.3 |
| Maximum | 1063 |
| Range | 1062.9999 |
| Interquartile range (IQR) | 0.03 |
Descriptive statistics
| Standard deviation | 5.3022697 |
|---|---|
| Coefficient of variation (CV) | 25.662322 |
| Kurtosis | 38637.88 |
| Mean | 0.20661691 |
| Median Absolute Deviation (MAD) | 0.01 |
| Skewness | 193.10419 |
| Sum | 8633.6942 |
| Variance | 28.114064 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.1 | 12395 | |
| 0.09 | 6522 | |
| 0.16 | 3341 | 7.9% |
| 0.04 | 1995 | 4.7% |
| 0.06 | 1492 | 3.5% |
| 0.12 | 1380 | 3.2% |
| 0.07 | 995 | 2.3% |
| 0.08 | 828 | 1.9% |
| 1 | 566 | 1.3% |
| 0.2 | 488 | 1.1% |
| Other values (494) | 11784 | |
| (Missing) | 711 | 1.7% |
| Value | Count | Frequency (%) |
| 0.00012 | 1 | < 0.1% |
| 0.0002 | 4 | |
| 0.000364 | 1 | < 0.1% |
| 0.0004 | 8 | |
| 0.0005 | 4 | |
| 0.0006 | 4 | |
| 0.0007 | 4 | |
| 0.000725 | 3 | < 0.1% |
| 0.001 | 3 | < 0.1% |
| 0.002 | 5 |
| Value | Count | Frequency (%) |
| 1063 | 1 | < 0.1% |
| 100 | 1 | < 0.1% |
| 70 | 2 | |
| 40 | 3 | |
| 36.1 | 4 | |
| 36 | 2 | |
| 35.8 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 22.4 | 1 | < 0.1% |
| 20.78 | 1 | < 0.1% |
Cell_architecture
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| nip | |
|---|---|
| pin | |
| Back contacted | 33 |
| Unknown | 8 |
| Front contacted | 7 |
| Other values (2) | 6 |
Length
| Max length | 17 |
|---|---|
| Median length | 3 |
| Mean length | 3.0121891 |
| Min length | 3 |
Characters and Unicode
| Total characters | 128009 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | nip |
|---|---|
| 2nd row | nip |
| 3rd row | nip |
| 4th row | nip |
| 5th row | nip |
Common Values
| Value | Count | Frequency (%) |
| nip | 29667 | |
| pin | 12776 | |
| Back contacted | 33 | 0.1% |
| Unknown | 8 | < 0.1% |
| Front contacted | 7 | < 0.1% |
| Schottky | 5 | < 0.1% |
| Pn-Heterojunction | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| nip | 29667 | |
| pin | 12776 | |
| contacted | 40 | 0.1% |
| back | 33 | 0.1% |
| unknown | 8 | < 0.1% |
| front | 7 | < 0.1% |
| schottky | 5 | < 0.1% |
| pn-heterojunction | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 42517 | |
| i | 42444 | |
| p | 42443 | |
| c | 119 | 0.1% |
| t | 99 | 0.1% |
| a | 73 | 0.1% |
| o | 62 | < 0.1% |
| k | 46 | < 0.1% |
| e | 42 | < 0.1% |
| 40 | < 0.1% | |
| Other values (14) | 124 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 127913 | |
| Uppercase Letter | 55 | < 0.1% |
| Space Separator | 40 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 42517 | |
| i | 42444 | |
| p | 42443 | |
| c | 119 | 0.1% |
| t | 99 | 0.1% |
| a | 73 | 0.1% |
| o | 62 | < 0.1% |
| k | 46 | < 0.1% |
| e | 42 | < 0.1% |
| d | 40 | < 0.1% |
| Other values (6) | 28 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 33 | |
| U | 8 | 14.5% |
| F | 7 | 12.7% |
| S | 5 | 9.1% |
| P | 1 | 1.8% |
| H | 1 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 40 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 127968 | |
| Common | 41 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 42517 | |
| i | 42444 | |
| p | 42443 | |
| c | 119 | 0.1% |
| t | 99 | 0.1% |
| a | 73 | 0.1% |
| o | 62 | < 0.1% |
| k | 46 | < 0.1% |
| e | 42 | < 0.1% |
| d | 40 | < 0.1% |
| Other values (12) | 83 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 40 | ||
| - | 1 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 128009 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 42517 | |
| i | 42444 | |
| p | 42443 | |
| c | 119 | 0.1% |
| t | 99 | 0.1% |
| a | 73 | 0.1% |
| o | 62 | < 0.1% |
| k | 46 | < 0.1% |
| e | 42 | < 0.1% |
| 40 | < 0.1% | |
| Other values (14) | 124 | 0.1% |
Cell_flexible
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.6 KiB |
| False | |
|---|---|
| True | 1072 |
| Value | Count | Frequency (%) |
| False | 41425 | |
| True | 1072 | 2.5% |
Module
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.6 KiB |
| False | |
|---|---|
| True | 346 |
| Value | Count | Frequency (%) |
| False | 42151 | |
| True | 346 | 0.8% |
Module_area_total
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 81 |
|---|---|
| Distinct (%) | 24.4% |
| Missing | 42165 |
| Missing (%) | 99.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.132743 |
| Minimum | 0.00024 |
|---|---|
| Maximum | 435 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 332.1 KiB |
Quantile statistics
| Minimum | 0.00024 |
|---|---|
| 5-th percentile | 0.25 |
| Q1 | 2.6775 |
| median | 13.65 |
| Q3 | 36.1 |
| 95-th percentile | 122.275 |
| Maximum | 435 |
| Range | 434.99976 |
| Interquartile range (IQR) | 33.4225 |
Descriptive statistics
| Standard deviation | 76.079615 |
|---|---|
| Coefficient of variation (CV) | 1.9441422 |
| Kurtosis | 17.122163 |
| Mean | 39.132743 |
| Median Absolute Deviation (MAD) | 12.65 |
| Skewness | 3.9701825 |
| Sum | 12992.071 |
| Variance | 5788.1079 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.25 | 40 | 0.1% |
| 25 | 35 | 0.1% |
| 100 | 25 | 0.1% |
| 12 | 19 | < 0.1% |
| 0.5024 | 12 | < 0.1% |
| 10 | 12 | < 0.1% |
| 10.8 | 10 | < 0.1% |
| 16 | 10 | < 0.1% |
| 36.1 | 9 | < 0.1% |
| 1 | 9 | < 0.1% |
| Other values (71) | 151 | 0.4% |
| (Missing) | 42165 |
| Value | Count | Frequency (%) |
| 0.00024 | 1 | < 0.1% |
| 0.000288 | 1 | < 0.1% |
| 0.00031 | 1 | < 0.1% |
| 0.00096 | 1 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.12 | 5 | < 0.1% |
| 0.2 | 1 | < 0.1% |
| 0.24 | 1 | < 0.1% |
| 0.25 | 40 | |
| 0.5024 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 435 | 8 | < 0.1% |
| 354.45 | 2 | < 0.1% |
| 231.04 | 1 | < 0.1% |
| 168.75 | 1 | < 0.1% |
| 156.25 | 4 | < 0.1% |
| 149.5 | 1 | < 0.1% |
| 100 | 25 | |
| 91.8 | 2 | < 0.1% |
| 80.55 | 8 | < 0.1% |
| 80 | 1 | < 0.1% |
Substrate_stack_sequence
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 194 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| SLG | FTO | |
|---|---|
| SLG | ITO | |
| PET | ITO | 374 |
| PEN | ITO | 233 |
| PET | IZO | 62 |
| Other values (189) | 711 |
Length
| Max length | 90 |
|---|---|
| Median length | 9 |
| Mean length | 9.062875 |
| Min length | 2 |
Characters and Unicode
| Total characters | 385145 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 84 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | SLG | FTO |
|---|---|
| 2nd row | SLG | FTO |
| 3rd row | SLG | FTO |
| 4th row | SLG | FTO |
| 5th row | SLG | FTO |
Common Values
| Value | Count | Frequency (%) |
| SLG | FTO | 26261 | |
| SLG | ITO | 14856 | |
| PET | ITO | 374 | 0.9% |
| PEN | ITO | 233 | 0.5% |
| PET | IZO | 62 | 0.1% |
| SLG | AZO | 45 | 0.1% |
| SLG | 39 | 0.1% |
| Ti-foil | 30 | 0.1% |
| PET | 30 | 0.1% |
| PET | Ag-grid | 28 | 0.1% |
| Other values (184) | 539 | 1.3% |
Length
| Value | Count | Frequency (%) |
| 42524 | ||
| slg | 41391 | |
| fto | 26277 | |
| ito | 15558 | 12.2% |
| pet | 581 | 0.5% |
| pen | 271 | 0.2% |
| azo | 99 | 0.1% |
| graphene | 98 | 0.1% |
| izo | 73 | 0.1% |
| ag-nw | 50 | < 0.1% |
| Other values (124) | 708 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 85133 | ||
| T | 42625 | |
| | | 42524 | |
| O | 42173 | |
| S | 41558 | |
| G | 41525 | |
| L | 41395 | |
| F | 26298 | 6.8% |
| I | 15666 | 4.1% |
| P | 1005 | 0.3% |
| Other values (48) | 5243 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 254376 | |
| Space Separator | 85133 | 22.1% |
| Math Symbol | 42524 | 11.0% |
| Lowercase Letter | 2634 | 0.7% |
| Dash Punctuation | 223 | 0.1% |
| Decimal Number | 168 | < 0.1% |
| Other Punctuation | 87 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 322 | |
| i | 294 | |
| n | 289 | |
| r | 215 | 8.2% |
| g | 198 | 7.5% |
| a | 184 | 7.0% |
| l | 164 | 6.2% |
| o | 141 | 5.4% |
| p | 134 | 5.1% |
| h | 120 | 4.6% |
| Other values (13) | 573 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 42625 | |
| O | 42173 | |
| S | 41558 | |
| G | 41525 | |
| L | 41395 | |
| F | 26298 | |
| I | 15666 | 6.2% |
| P | 1005 | 0.4% |
| E | 933 | 0.4% |
| A | 371 | 0.1% |
| Other values (12) | 827 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 60 | |
| 2 | 55 | |
| 0 | 18 | 10.7% |
| 6 | 15 | 8.9% |
| 1 | 7 | 4.2% |
| 8 | 7 | 4.2% |
| 4 | 5 | 3.0% |
| 5 | 1 | 0.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 45 | |
| ; | 42 |
Space Separator
| Value | Count | Frequency (%) |
| 85133 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 42524 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 223 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 257010 | |
| Common | 128135 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 42625 | |
| O | 42173 | |
| S | 41558 | |
| G | 41525 | |
| L | 41395 | |
| F | 26298 | |
| I | 15666 | 6.1% |
| P | 1005 | 0.4% |
| E | 933 | 0.4% |
| A | 371 | 0.1% |
| Other values (35) | 3461 | 1.3% |
Common
| Value | Count | Frequency (%) |
| 85133 | ||
| | | 42524 | |
| - | 223 | 0.2% |
| 3 | 60 | < 0.1% |
| 2 | 55 | < 0.1% |
| : | 45 | < 0.1% |
| ; | 42 | < 0.1% |
| 0 | 18 | < 0.1% |
| 6 | 15 | < 0.1% |
| 1 | 7 | < 0.1% |
| Other values (3) | 13 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 385145 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 85133 | ||
| T | 42625 | |
| | | 42524 | |
| O | 42173 | |
| S | 41558 | |
| G | 41525 | |
| L | 41395 | |
| F | 26298 | 6.8% |
| I | 15666 | 4.1% |
| P | 1005 | 0.3% |
| Other values (48) | 5243 | 1.4% |
ETL_stack_sequence
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 1444 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| TiO2-c | TiO2-mp | |
|---|---|
| TiO2-c | |
| PCBM-60 | |
| PCBM-60 | BCP | |
| SnO2-np | |
| Other values (1439) |
Length
| Max length | 190 |
|---|---|
| Median length | 185 |
| Mean length | 12.302963 |
| Min length | 2 |
Characters and Unicode
| Total characters | 522839 |
|---|---|
| Distinct characters | 80 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 532 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | TiO2-c | TiO2-mp |
|---|---|
| 2nd row | TiO2-c | TiO2-mp |
| 3rd row | TiO2-c | TiO2-mp |
| 4th row | TiO2-c | TiO2-mp |
| 5th row | TiO2-c | TiO2-mp |
Common Values
| Value | Count | Frequency (%) |
| TiO2-c | TiO2-mp | 11705 | |
| TiO2-c | 6882 | |
| PCBM-60 | 3281 | 7.7% |
| PCBM-60 | BCP | 2662 | 6.3% |
| SnO2-np | 1652 | 3.9% |
| C60 | BCP | 1508 | 3.5% |
| SnO2-c | 1399 | 3.3% |
| TiO2-c | TiO2-mp | ZrO2-mp | 644 | 1.5% |
| ZnO-c | 484 | 1.1% |
| PCBM-60 | C60 | BCP | 461 | 1.1% |
| Other values (1434) | 11819 |
Length
| Value | Count | Frequency (%) |
| 28790 | ||
| tio2-c | 22073 | |
| tio2-mp | 13409 | |
| pcbm-60 | 10553 | 10.4% |
| bcp | 5249 | 5.2% |
| c60 | 3548 | 3.5% |
| sno2-c | 2267 | 2.2% |
| sno2-np | 2070 | 2.0% |
| zno-c | 1080 | 1.1% |
| zno-np | 869 | 0.9% |
| Other values (875) | 11217 | 11.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 58628 | ||
| - | 57921 | |
| O | 46434 | 8.9% |
| 2 | 43503 | 8.3% |
| i | 39041 | 7.5% |
| T | 37804 | 7.2% |
| | | 28778 | 5.5% |
| c | 26248 | 5.0% |
| C | 21542 | 4.1% |
| p | 19495 | 3.7% |
| Other values (70) | 143445 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 171201 | |
| Lowercase Letter | 128149 | |
| Decimal Number | 76479 | |
| Space Separator | 58628 | 11.2% |
| Dash Punctuation | 57965 | 11.1% |
| Math Symbol | 28778 | 5.5% |
| Other Punctuation | 1004 | 0.2% |
| Close Punctuation | 317 | 0.1% |
| Open Punctuation | 317 | 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 46434 | |
| T | 37804 | |
| C | 21542 | |
| P | 18213 | 10.6% |
| B | 17341 | 10.1% |
| M | 11500 | 6.7% |
| S | 5669 | 3.3% |
| Z | 3800 | 2.2% |
| A | 1802 | 1.1% |
| I | 1427 | 0.8% |
| Other values (16) | 5669 | 3.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 39041 | |
| c | 26248 | |
| p | 19495 | |
| m | 15436 | 12.0% |
| n | 14197 | 11.1% |
| e | 2193 | 1.7% |
| r | 1733 | 1.4% |
| o | 1320 | 1.0% |
| a | 1311 | 1.0% |
| h | 1274 | 1.0% |
| Other values (15) | 5901 | 4.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 43503 | |
| 0 | 15396 | 20.1% |
| 6 | 14774 | 19.3% |
| 3 | 1069 | 1.4% |
| 1 | 595 | 0.8% |
| 7 | 413 | 0.5% |
| 4 | 397 | 0.5% |
| 5 | 203 | 0.3% |
| 8 | 91 | 0.1% |
| 9 | 38 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 643 | |
| , | 141 | 14.0% |
| : | 86 | 8.6% |
| @ | 60 | 6.0% |
| . | 37 | 3.7% |
| ′ | 24 | 2.4% |
| / | 8 | 0.8% |
| * | 5 | 0.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 282 | |
| ] | 31 | 9.8% |
| } | 4 | 1.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 282 | |
| [ | 31 | 9.8% |
| { | 4 | 1.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 57921 | |
| ‐ | 44 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 58628 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 28778 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 299350 | |
| Common | 223489 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 46434 | |
| i | 39041 | |
| T | 37804 | |
| c | 26248 | |
| C | 21542 | |
| p | 19495 | |
| P | 18213 | 6.1% |
| B | 17341 | 5.8% |
| m | 15436 | 5.2% |
| n | 14197 | 4.7% |
| Other values (41) | 43599 |
Common
| Value | Count | Frequency (%) |
| 58628 | ||
| - | 57921 | |
| 2 | 43503 | |
| | | 28778 | |
| 0 | 15396 | 6.9% |
| 6 | 14774 | 6.6% |
| 3 | 1069 | 0.5% |
| ; | 643 | 0.3% |
| 1 | 595 | 0.3% |
| 7 | 413 | 0.2% |
| Other values (19) | 1769 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 522771 | |
| Punctuation | 68 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 58628 | ||
| - | 57921 | |
| O | 46434 | 8.9% |
| 2 | 43503 | 8.3% |
| i | 39041 | 7.5% |
| T | 37804 | 7.2% |
| | | 28778 | 5.5% |
| c | 26248 | 5.0% |
| C | 21542 | 4.1% |
| p | 19495 | 3.7% |
| Other values (68) | 143377 |
Punctuation
| Value | Count | Frequency (%) |
| ‐ | 44 | |
| ′ | 24 |
ETL_thickness
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 1419 |
|---|---|
| Distinct (%) | 7.6% |
| Missing | 23933 |
| Missing (%) | 56.3% |
| Memory size | 332.1 KiB |
| 50.0 | 1111 |
|---|---|
| 40.0 | 906 |
| 30.0 | 631 |
| 60.0 | 522 |
| 20.0 | 420 |
| Other values (1414) |
Length
| Max length | 46 |
|---|---|
| Median length | 32 |
| Mean length | 9.3584357 |
| Min length | 3 |
Characters and Unicode
| Total characters | 173730 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 543 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | 65.0 | nan |
|---|---|
| 2nd row | 65.0 | nan |
| 3rd row | 65.0 | nan |
| 4th row | 65.0 | nan |
| 5th row | 65.0 | nan |
Common Values
| Value | Count | Frequency (%) |
| 50.0 | 1111 | 2.6% |
| 40.0 | 906 | 2.1% |
| 30.0 | 631 | 1.5% |
| 60.0 | 522 | 1.2% |
| 20.0 | 420 | 1.0% |
| nan | 200.0 | 367 | 0.9% |
| 30.0 | nan | 344 | 0.8% |
| 80.0 | 313 | 0.7% |
| 50.0 | nan | 313 | 0.7% |
| 100.0 | 303 | 0.7% |
| Other values (1409) | 13334 | |
| (Missing) | 23933 |
Length
| Value | Count | Frequency (%) |
| 14370 | ||
| nan | 6057 | |
| 50.0 | 2963 | 6.3% |
| 30.0 | 2519 | 5.3% |
| 40.0 | 2073 | 4.4% |
| 20.0 | 1844 | 3.9% |
| 60.0 | 1275 | 2.7% |
| 8.0 | 1170 | 2.5% |
| 200.0 | 1170 | 2.5% |
| 10.0 | 1163 | 2.5% |
| Other values (374) | 12700 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 52415 | |
| 28740 | ||
| . | 26873 | |
| | | 14370 | 8.3% |
| n | 12122 | 7.0% |
| 5 | 8225 | 4.7% |
| a | 6057 | 3.5% |
| 1 | 5641 | 3.2% |
| 2 | 5097 | 2.9% |
| 3 | 4300 | 2.5% |
| Other values (9) | 9890 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 85552 | |
| Space Separator | 28740 | 16.5% |
| Other Punctuation | 26873 | 15.5% |
| Lowercase Letter | 18195 | 10.5% |
| Math Symbol | 14370 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 52415 | |
| 5 | 8225 | 9.6% |
| 1 | 5641 | 6.6% |
| 2 | 5097 | 6.0% |
| 3 | 4300 | 5.0% |
| 4 | 3384 | 4.0% |
| 8 | 2399 | 2.8% |
| 6 | 2251 | 2.6% |
| 7 | 1431 | 1.7% |
| 9 | 409 | 0.5% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 12122 | |
| a | 6057 | |
| u | 4 | < 0.1% |
| k | 4 | < 0.1% |
| o | 4 | < 0.1% |
| w | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 28740 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 26873 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 14370 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 155535 | |
| Latin | 18195 | 10.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 52415 | |
| 28740 | ||
| . | 26873 | |
| | | 14370 | 9.2% |
| 5 | 8225 | 5.3% |
| 1 | 5641 | 3.6% |
| 2 | 5097 | 3.3% |
| 3 | 4300 | 2.8% |
| 4 | 3384 | 2.2% |
| 8 | 2399 | 1.5% |
| Other values (3) | 4091 | 2.6% |
Latin
| Value | Count | Frequency (%) |
| n | 12122 | |
| a | 6057 | |
| u | 4 | < 0.1% |
| k | 4 | < 0.1% |
| o | 4 | < 0.1% |
| w | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 173730 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 52415 | |
| 28740 | ||
| . | 26873 | |
| | | 14370 | 8.3% |
| n | 12122 | 7.0% |
| 5 | 8225 | 4.7% |
| a | 6057 | 3.5% |
| 1 | 5641 | 3.2% |
| 2 | 5097 | 2.9% |
| 3 | 4300 | 2.5% |
| Other values (9) | 9890 | 5.7% |
ETL_additives_compounds
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 447 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 3293 |
| Missing (%) | 7.7% |
| Memory size | 332.1 KiB |
| Unknown | |
|---|---|
| Unknown | TiCl4 | 1000 |
| Undoped | 530 |
| Unknown | Li-TFSI | 529 |
| TiCl4 | 472 |
| Other values (442) | 2984 |
Length
| Max length | 51 |
|---|---|
| Median length | 7 |
| Mean length | 7.7830323 |
| Min length | 1 |
Characters and Unicode
| Total characters | 305126 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 132 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Unknown |
|---|---|
| 2nd row | Unknown |
| 3rd row | Unknown |
| 4th row | Unknown |
| 5th row | Unknown |
Common Values
| Value | Count | Frequency (%) |
| Unknown | 33689 | |
| Unknown | TiCl4 | 1000 | 2.4% |
| Undoped | 530 | 1.2% |
| Unknown | Li-TFSI | 529 | 1.2% |
| TiCl4 | 472 | 1.1% |
| TiCl4 | Unknown | 334 | 0.8% |
| Undoped | Undoped | 313 | 0.7% |
| Nb | 78 | 0.2% |
| nan | TiCl4 | 75 | 0.2% |
| Li-TFSI | 45 | 0.1% |
| Other values (437) | 2139 | 5.0% |
| (Missing) | 3293 | 7.7% |
Length
| Value | Count | Frequency (%) |
| unknown | 36562 | |
| 3704 | 7.9% | |
| ticl4 | 2206 | 4.7% |
| undoped | 1366 | 2.9% |
| li-tfsi | 688 | 1.5% |
| nan | 219 | 0.5% |
| nb | 151 | 0.3% |
| graphene | 58 | 0.1% |
| mg | 55 | 0.1% |
| in | 45 | 0.1% |
| Other values (302) | 1972 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 112111 | |
| o | 38274 | 12.5% |
| U | 37959 | 12.4% |
| w | 36568 | 12.0% |
| k | 36562 | 12.0% |
| 7809 | 2.6% | |
| | | 3704 | 1.2% |
| i | 3495 | 1.1% |
| T | 3190 | 1.0% |
| d | 2891 | 0.9% |
| Other values (63) | 22563 | 7.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 239385 | |
| Uppercase Letter | 49818 | 16.3% |
| Space Separator | 7822 | 2.6% |
| Math Symbol | 3704 | 1.2% |
| Decimal Number | 2974 | 1.0% |
| Dash Punctuation | 1001 | 0.3% |
| Other Punctuation | 306 | 0.1% |
| Close Punctuation | 58 | < 0.1% |
| Open Punctuation | 58 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 37959 | |
| T | 3190 | 6.4% |
| C | 2662 | 5.3% |
| S | 964 | 1.9% |
| I | 938 | 1.9% |
| F | 853 | 1.7% |
| L | 785 | 1.6% |
| N | 416 | 0.8% |
| A | 365 | 0.7% |
| P | 267 | 0.5% |
| Other values (15) | 1419 | 2.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 112111 | |
| o | 38274 | 16.0% |
| w | 36568 | 15.3% |
| k | 36562 | 15.3% |
| i | 3495 | 1.5% |
| d | 2891 | 1.2% |
| l | 2718 | 1.1% |
| e | 2030 | 0.8% |
| p | 1608 | 0.7% |
| a | 762 | 0.3% |
| Other values (14) | 2366 | 1.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2288 | |
| 2 | 259 | 8.7% |
| 3 | 174 | 5.9% |
| 0 | 88 | 3.0% |
| 1 | 60 | 2.0% |
| 6 | 55 | 1.8% |
| 5 | 29 | 1.0% |
| 9 | 10 | 0.3% |
| 7 | 8 | 0.3% |
| 8 | 3 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 198 | |
| : | 39 | 12.7% |
| @ | 29 | 9.5% |
| · | 14 | 4.6% |
| , | 10 | 3.3% |
| . | 6 | 2.0% |
| * | 5 | 1.6% |
| ′ | 5 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 7809 | ||
| 13 | 0.2% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 3704 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1001 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 58 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 58 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 289203 | |
| Common | 15923 | 5.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 112111 | |
| o | 38274 | 13.2% |
| U | 37959 | 13.1% |
| w | 36568 | 12.6% |
| k | 36562 | 12.6% |
| i | 3495 | 1.2% |
| T | 3190 | 1.1% |
| d | 2891 | 1.0% |
| l | 2718 | 0.9% |
| C | 2662 | 0.9% |
| Other values (39) | 12773 | 4.4% |
Common
| Value | Count | Frequency (%) |
| 7809 | ||
| | | 3704 | |
| 4 | 2288 | 14.4% |
| - | 1001 | 6.3% |
| 2 | 259 | 1.6% |
| ; | 198 | 1.2% |
| 3 | 174 | 1.1% |
| 0 | 88 | 0.6% |
| 1 | 60 | 0.4% |
| ) | 58 | 0.4% |
| Other values (14) | 284 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 305094 | |
| None | 27 | < 0.1% |
| Punctuation | 5 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 112111 | |
| o | 38274 | 12.5% |
| U | 37959 | 12.4% |
| w | 36568 | 12.0% |
| k | 36562 | 12.0% |
| 7809 | 2.6% | |
| | | 3704 | 1.2% |
| i | 3495 | 1.1% |
| T | 3190 | 1.0% |
| d | 2891 | 0.9% |
| Other values (60) | 22531 | 7.4% |
None
| Value | Count | Frequency (%) |
| · | 14 | |
| 13 |
Punctuation
| Value | Count | Frequency (%) |
| ′ | 5 |
ETL_deposition_procedure
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 368 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| Spin-coating | |
|---|---|
| Spin-coating | Spin-coating | |
| Spray-pyrolys | Spin-coating | |
| Evaporation | Evaporation | |
| Spin-coating | Evaporation | |
| Other values (363) |
Length
| Max length | 155 |
|---|---|
| Median length | 102 |
| Mean length | 21.915618 |
| Min length | 3 |
Characters and Unicode
| Total characters | 931348 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 68 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Spray-pyrolys | Spin-coating |
|---|---|
| 2nd row | Spray-pyrolys | Spin-coating |
| 3rd row | Spray-pyrolys | Spin-coating |
| 4th row | Spray-pyrolys | Spin-coating |
| 5th row | Spray-pyrolys | Spin-coating |
Common Values
| Value | Count | Frequency (%) |
| Spin-coating | 12263 | |
| Spin-coating | Spin-coating | 11725 | |
| Spray-pyrolys | Spin-coating | 4655 | 11.0% |
| Evaporation | Evaporation | 1715 | 4.0% |
| Spin-coating | Evaporation | 1496 | 3.5% |
| CBD | 1127 | 2.7% |
| Spray-pyrolys | 923 | 2.2% |
| Spin-coating | Evaporation | Evaporation | 722 | 1.7% |
| Unknown | 685 | 1.6% |
| Spin-coating | Spin-coating | Spin-coating | 531 | 1.2% |
| Other values (358) | 6655 |
Length
| Value | Count | Frequency (%) |
| spin-coating | 48497 | |
| 29381 | ||
| evaporation | 7590 | 7.3% |
| spray-pyrolys | 6645 | 6.4% |
| cbd | 2070 | 2.0% |
| printing | 1729 | 1.7% |
| screen | 1728 | 1.7% |
| hydrothermal | 1025 | 1.0% |
| unknown | 902 | 0.9% |
| ald | 810 | 0.8% |
| Other values (86) | 3918 | 3.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 115684 | |
| i | 111704 | |
| o | 75063 | 8.1% |
| a | 73557 | 7.9% |
| p | 73518 | 7.9% |
| t | 62756 | 6.7% |
| 61798 | 6.6% | |
| S | 57628 | 6.2% |
| - | 56110 | 6.0% |
| g | 52539 | 5.6% |
| Other values (37) | 190991 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 705136 | |
| Uppercase Letter | 78144 | 8.4% |
| Space Separator | 61798 | 6.6% |
| Dash Punctuation | 56110 | 6.0% |
| Math Symbol | 30160 | 3.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 115684 | |
| i | 111704 | |
| o | 75063 | |
| a | 73557 | |
| p | 73518 | |
| t | 62756 | |
| g | 52539 | |
| c | 51839 | |
| r | 28493 | 4.0% |
| y | 21197 | 3.0% |
| Other values (16) | 38786 | 5.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 57628 | |
| E | 7963 | 10.2% |
| D | 3888 | 5.0% |
| C | 2199 | 2.8% |
| B | 2085 | 2.7% |
| H | 1033 | 1.3% |
| U | 921 | 1.2% |
| L | 853 | 1.1% |
| A | 842 | 1.1% |
| M | 226 | 0.3% |
| Other values (7) | 506 | 0.6% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 28602 | |
| > | 1558 | 5.2% |
Space Separator
| Value | Count | Frequency (%) |
| 61798 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 56110 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 783280 | |
| Common | 148068 | 15.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 115684 | |
| i | 111704 | |
| o | 75063 | |
| a | 73557 | |
| p | 73518 | |
| t | 62756 | |
| S | 57628 | |
| g | 52539 | |
| c | 51839 | |
| r | 28493 | 3.6% |
| Other values (33) | 80499 |
Common
| Value | Count | Frequency (%) |
| 61798 | ||
| - | 56110 | |
| | | 28602 | |
| > | 1558 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 931348 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 115684 | |
| i | 111704 | |
| o | 75063 | 8.1% |
| a | 73557 | 7.9% |
| p | 73518 | 7.9% |
| t | 62756 | 6.7% |
| 61798 | 6.6% | |
| S | 57628 | 6.2% |
| - | 56110 | 6.0% |
| g | 52539 | 5.6% |
| Other values (37) | 190991 |
ETL_surface_treatment_before_next_deposition_step
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 13 |
|---|---|
| Distinct (%) | 7.8% |
| Missing | 42330 |
| Missing (%) | 99.6% |
| Memory size | 332.1 KiB |
| UV-Ozone | |
|---|---|
| Plasma | |
| UV | |
| Ozone | 10 |
| O2 plasma | 5 |
| Other values (8) |
Length
| Max length | 30 |
|---|---|
| Median length | 8 |
| Mean length | 8.1616766 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1363 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | UV |
|---|---|
| 2nd row | UV |
| 3rd row | Ozone |
| 4th row | Plasma |
| 5th row | Water |
Common Values
| Value | Count | Frequency (%) |
| UV-Ozone | 108 | 0.3% |
| Plasma | 12 | < 0.1% |
| UV | 11 | < 0.1% |
| Ozone | 10 | < 0.1% |
| O2 plasma | 5 | < 0.1% |
| ZnAl-LDH and thermal annealing | 4 | < 0.1% |
| Wash with IPA | 4 | < 0.1% |
| Washed with methanol | 3 | < 0.1% |
| Water | 2 | < 0.1% |
| CO2 | 2 | < 0.1% |
| Other values (3) | 6 | < 0.1% |
| (Missing) | 42330 |
Length
| Value | Count | Frequency (%) |
| uv-ozone | 108 | |
| plasma | 19 | 9.3% |
| uv | 11 | 5.4% |
| ozone | 10 | 4.9% |
| with | 7 | 3.4% |
| o2 | 5 | 2.5% |
| znal-ldh | 4 | 2.0% |
| and | 4 | 2.0% |
| thermal | 4 | 2.0% |
| annealing | 4 | 2.0% |
| Other values (11) | 28 | 13.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 145 | |
| e | 142 | |
| O | 125 | |
| o | 123 | |
| U | 119 | |
| V | 119 | |
| z | 118 | |
| - | 112 | |
| a | 68 | 5.0% |
| 37 | 2.7% | |
| Other values (24) | 255 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 781 | |
| Uppercase Letter | 424 | |
| Dash Punctuation | 112 | 8.2% |
| Space Separator | 37 | 2.7% |
| Decimal Number | 9 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 145 | |
| e | 142 | |
| o | 123 | |
| z | 118 | |
| a | 68 | |
| l | 34 | 4.4% |
| s | 26 | 3.3% |
| m | 26 | 3.3% |
| h | 23 | 2.9% |
| t | 20 | 2.6% |
| Other values (8) | 56 | 7.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 125 | |
| U | 119 | |
| V | 119 | |
| P | 16 | 3.8% |
| W | 9 | 2.1% |
| H | 8 | 1.9% |
| A | 8 | 1.9% |
| D | 4 | 0.9% |
| L | 4 | 0.9% |
| Z | 4 | 0.9% |
| Other values (3) | 8 | 1.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 112 |
Space Separator
| Value | Count | Frequency (%) |
| 37 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 9 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1205 | |
| Common | 158 | 11.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 145 | |
| e | 142 | |
| O | 125 | |
| o | 123 | |
| U | 119 | |
| V | 119 | |
| z | 118 | |
| a | 68 | 5.6% |
| l | 34 | 2.8% |
| s | 26 | 2.2% |
| Other values (21) | 186 |
Common
| Value | Count | Frequency (%) |
| - | 112 | |
| 37 | 23.4% | |
| 2 | 9 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1363 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 145 | |
| e | 142 | |
| O | 125 | |
| o | 123 | |
| U | 119 | |
| V | 119 | |
| z | 118 | |
| - | 112 | |
| a | 68 | 5.0% |
| 37 | 2.7% | |
| Other values (24) | 255 |
Perovskite_dimension_2D
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.6 KiB |
| False | |
|---|---|
| True | 1004 |
| Value | Count | Frequency (%) |
| False | 41493 | |
| True | 1004 | 2.4% |
Perovskite_dimension_3D
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.6 KiB |
| True | |
|---|---|
| False | 1352 |
| Value | Count | Frequency (%) |
| True | 41145 | |
| False | 1352 | 3.2% |
Perovskite_dimension_3D_with_2D_capping_layer
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.6 KiB |
| False | |
|---|---|
| True | 190 |
| Value | Count | Frequency (%) |
| False | 42307 | |
| True | 190 | 0.4% |
Perovskite_composition_perovskite_ABC3_structure
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.6 KiB |
| True | |
|---|---|
| False | 788 |
| Value | Count | Frequency (%) |
| True | 41709 | |
| False | 788 | 1.9% |
Perovskite_composition_a_ions
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 264 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 31 |
| Missing (%) | 0.1% |
| Memory size | 332.1 KiB |
| MA | |
|---|---|
| FA; MA | |
| Cs; FA; MA | |
| Cs | 2070 |
| Cs; FA | 1061 |
| Other values (259) |
Length
| Max length | 26 |
|---|---|
| Median length | 2 |
| Mean length | 3.493642 |
| Min length | 1 |
Characters and Unicode
| Total characters | 148361 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 96 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Cs |
|---|---|
| 2nd row | Cs |
| 3rd row | Cs |
| 4th row | Cs |
| 5th row | Cs |
Common Values
| Value | Count | Frequency (%) |
| MA | 28620 | |
| FA; MA | 3985 | 9.4% |
| Cs; FA; MA | 3823 | 9.0% |
| Cs | 2070 | 4.9% |
| Cs; FA | 1061 | 2.5% |
| FA | 991 | 2.3% |
| BA; MA | 218 | 0.5% |
| Cs; MA | 132 | 0.3% |
| (PEA); MA | 130 | 0.3% |
| GU; MA | 56 | 0.1% |
| Other values (254) | 1380 | 3.2% |
Length
| Value | Count | Frequency (%) |
| ma | 37756 | |
| fa | 10382 | 18.0% |
| cs | 7590 | 13.1% |
| ba | 376 | 0.7% |
| 277 | 0.5% | |
| pea | 271 | 0.5% |
| ag | 116 | 0.2% |
| gu | 107 | 0.2% |
| 5-ava | 55 | 0.1% |
| ha | 48 | 0.1% |
| Other values (118) | 776 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 49583 | |
| M | 37913 | |
| 15288 | 10.3% | |
| ; | 14734 | 9.9% |
| F | 10436 | 7.0% |
| C | 7657 | 5.2% |
| s | 7592 | 5.1% |
| ( | 867 | 0.6% |
| ) | 867 | 0.6% |
| P | 523 | 0.4% |
| Other values (48) | 2901 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 108037 | |
| Space Separator | 15288 | 10.3% |
| Other Punctuation | 14736 | 9.9% |
| Lowercase Letter | 7965 | 5.4% |
| Open Punctuation | 867 | 0.6% |
| Close Punctuation | 867 | 0.6% |
| Math Symbol | 277 | 0.2% |
| Decimal Number | 250 | 0.2% |
| Dash Punctuation | 74 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 49583 | |
| M | 37913 | |
| F | 10436 | 9.7% |
| C | 7657 | 7.1% |
| P | 523 | 0.5% |
| B | 483 | 0.4% |
| E | 464 | 0.4% |
| H | 177 | 0.2% |
| D | 128 | 0.1% |
| G | 124 | 0.1% |
| Other values (12) | 549 | 0.5% |
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 7592 | |
| g | 116 | 1.5% |
| b | 44 | 0.6% |
| h | 31 | 0.4% |
| a | 28 | 0.4% |
| y | 27 | 0.3% |
| i | 21 | 0.3% |
| m | 20 | 0.3% |
| r | 14 | 0.2% |
| n | 13 | 0.2% |
| Other values (10) | 59 | 0.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 68 | |
| 4 | 65 | |
| 5 | 64 | |
| 1 | 24 | 9.6% |
| 6 | 13 | 5.2% |
| 2 | 11 | 4.4% |
| 7 | 2 | 0.8% |
| 9 | 2 | 0.8% |
| 8 | 1 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 14734 | |
| . | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 15288 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 867 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 867 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 277 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 74 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 116002 | |
| Common | 32359 | 21.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 49583 | |
| M | 37913 | |
| F | 10436 | 9.0% |
| C | 7657 | 6.6% |
| s | 7592 | 6.5% |
| P | 523 | 0.5% |
| B | 483 | 0.4% |
| E | 464 | 0.4% |
| H | 177 | 0.2% |
| D | 128 | 0.1% |
| Other values (32) | 1046 | 0.9% |
Common
| Value | Count | Frequency (%) |
| 15288 | ||
| ; | 14734 | |
| ( | 867 | 2.7% |
| ) | 867 | 2.7% |
| | | 277 | 0.9% |
| - | 74 | 0.2% |
| 3 | 68 | 0.2% |
| 4 | 65 | 0.2% |
| 5 | 64 | 0.2% |
| 1 | 24 | 0.1% |
| Other values (6) | 31 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 148361 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 49583 | |
| M | 37913 | |
| 15288 | 10.3% | |
| ; | 14734 | 9.9% |
| F | 10436 | 7.0% |
| C | 7657 | 5.2% |
| s | 7592 | 5.1% |
| ( | 867 | 0.6% |
| ) | 867 | 0.6% |
| P | 523 | 0.4% |
| Other values (48) | 2901 | 2.0% |
Perovskite_composition_a_ions_coefficients
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 565 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 59 |
| Missing (%) | 0.1% |
| Memory size | 332.1 KiB |
| 1 | |
|---|---|
| 0.05; 0.79; 0.16 | 1555 |
| 0.85; 0.15 | 1472 |
| 0.83; 0.17 | 556 |
| 0.05; 0.81; 0.14 | 474 |
| Other values (560) |
Length
| Max length | 29 |
|---|---|
| Median length | 1 |
| Mean length | 3.6501956 |
| Min length | 1 |
Characters and Unicode
| Total characters | 154907 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 190 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 31551 | |
| 0.05; 0.79; 0.16 | 1555 | 3.7% |
| 0.85; 0.15 | 1472 | 3.5% |
| 0.83; 0.17 | 556 | 1.3% |
| 0.05; 0.81; 0.14 | 474 | 1.1% |
| 0.1; 0.9 | 318 | 0.7% |
| 0.17; 0.83 | 281 | 0.7% |
| 0.2; 0.8 | 247 | 0.6% |
| 0.3; 0.7 | 227 | 0.5% |
| 0.15; 0.85 | 218 | 0.5% |
| Other values (555) | 5539 | 13.0% |
Length
| Value | Count | Frequency (%) |
| 1 | 32280 | |
| 0.05 | 3352 | 5.8% |
| 0.15 | 2390 | 4.1% |
| 0.85 | 1830 | 3.2% |
| 0.16 | 1782 | 3.1% |
| 0.79 | 1700 | 2.9% |
| 2 | 1034 | 1.8% |
| 0.1 | 995 | 1.7% |
| 0.83 | 964 | 1.7% |
| 0.17 | 924 | 1.6% |
| Other values (264) | 10406 | 18.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 40940 | |
| 0 | 27528 | |
| . | 23221 | |
| 15219 | 9.8% | |
| ; | 14665 | 9.5% |
| 5 | 10242 | 6.6% |
| 8 | 5304 | 3.4% |
| 7 | 4734 | 3.1% |
| 9 | 3069 | 2.0% |
| 6 | 2942 | 1.9% |
| Other values (5) | 7043 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 101479 | |
| Other Punctuation | 37886 | 24.5% |
| Space Separator | 15219 | 9.8% |
| Math Symbol | 277 | 0.2% |
| Lowercase Letter | 46 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 40940 | |
| 0 | 27528 | |
| 5 | 10242 | 10.1% |
| 8 | 5304 | 5.2% |
| 7 | 4734 | 4.7% |
| 9 | 3069 | 3.0% |
| 6 | 2942 | 2.9% |
| 2 | 2663 | 2.6% |
| 3 | 2434 | 2.4% |
| 4 | 1623 | 1.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 23221 | |
| ; | 14665 |
Space Separator
| Value | Count | Frequency (%) |
| 15219 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 277 |
Lowercase Letter
| Value | Count | Frequency (%) |
| x | 46 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 154861 | |
| Latin | 46 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 40940 | |
| 0 | 27528 | |
| . | 23221 | |
| 15219 | 9.8% | |
| ; | 14665 | 9.5% |
| 5 | 10242 | 6.6% |
| 8 | 5304 | 3.4% |
| 7 | 4734 | 3.1% |
| 9 | 3069 | 2.0% |
| 6 | 2942 | 1.9% |
| Other values (4) | 6997 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| x | 46 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 154907 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 40940 | |
| 0 | 27528 | |
| . | 23221 | |
| 15219 | 9.8% | |
| ; | 14665 | 9.5% |
| 5 | 10242 | 6.6% |
| 8 | 5304 | 3.4% |
| 7 | 4734 | 3.1% |
| 9 | 3069 | 2.0% |
| 6 | 2942 | 1.9% |
| Other values (5) | 7043 | 4.5% |
Perovskite_composition_b_ions
Categorical
HIGH CARDINALITY  HIGH CORRELATION  IMBALANCE 
| Distinct | 53 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 332.1 KiB |
| Pb | |
|---|---|
| Sn | 636 |
| Pb; Sn | 499 |
| Bi | 263 |
| Pb | Pb | 246 |
| Other values (48) | 433 |
Length
| Max length | 22 |
|---|---|
| Median length | 2 |
| Mean length | 2.1077615 |
| Min length | 1 |
Characters and Unicode
| Total characters | 89563 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Sn |
|---|---|
| 2nd row | Sn |
| 3rd row | Sn |
| 4th row | Sn |
| 5th row | Sn |
Common Values
| Value | Count | Frequency (%) |
| Pb | 40415 | |
| Sn | 636 | 1.5% |
| Pb; Sn | 499 | 1.2% |
| Bi | 263 | 0.6% |
| Pb | Pb | 246 | 0.6% |
| Sb | 84 | 0.2% |
| Ag; Pb | 35 | 0.1% |
| Pb; Sb | 34 | 0.1% |
| Pb; Sr | 20 | < 0.1% |
| Ba; Pb | 19 | < 0.1% |
| Other values (43) | 241 | 0.6% |
Length
| Value | Count | Frequency (%) |
| pb | 41696 | |
| sn | 1158 | 2.6% |
| bi | 305 | 0.7% |
| 277 | 0.6% | |
| sb | 131 | 0.3% |
| ag | 44 | 0.1% |
| cu | 28 | 0.1% |
| ge | 21 | < 0.1% |
| ba | 20 | < 0.1% |
| sr | 20 | < 0.1% |
| Other values (20) | 145 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| b | 41835 | |
| P | 41696 | |
| 1353 | 1.5% | |
| S | 1310 | 1.5% |
| n | 1187 | 1.3% |
| ; | 799 | 0.9% |
| B | 325 | 0.4% |
| i | 320 | 0.4% |
| | | 277 | 0.3% |
| g | 58 | 0.1% |
| Other values (20) | 403 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 43568 | |
| Lowercase Letter | 43566 | |
| Space Separator | 1353 | 1.5% |
| Other Punctuation | 799 | 0.9% |
| Math Symbol | 277 | 0.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 41696 | |
| S | 1310 | 3.0% |
| B | 325 | 0.7% |
| C | 54 | 0.1% |
| A | 46 | 0.1% |
| G | 21 | < 0.1% |
| F | 19 | < 0.1% |
| T | 18 | < 0.1% |
| N | 16 | < 0.1% |
| M | 16 | < 0.1% |
| Other values (6) | 47 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| b | 41835 | |
| n | 1187 | 2.7% |
| i | 320 | 0.7% |
| g | 58 | 0.1% |
| e | 51 | 0.1% |
| u | 41 | 0.1% |
| a | 36 | 0.1% |
| r | 21 | < 0.1% |
| o | 15 | < 0.1% |
| m | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1353 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 799 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 277 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 87134 | |
| Common | 2429 | 2.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| b | 41835 | |
| P | 41696 | |
| S | 1310 | 1.5% |
| n | 1187 | 1.4% |
| B | 325 | 0.4% |
| i | 320 | 0.4% |
| g | 58 | 0.1% |
| C | 54 | 0.1% |
| e | 51 | 0.1% |
| A | 46 | 0.1% |
| Other values (17) | 252 | 0.3% |
Common
| Value | Count | Frequency (%) |
| 1353 | ||
| ; | 799 | |
| | | 277 | 11.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 89563 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| b | 41835 | |
| P | 41696 | |
| 1353 | 1.5% | |
| S | 1310 | 1.5% |
| n | 1187 | 1.3% |
| ; | 799 | 0.9% |
| B | 325 | 0.4% |
| i | 320 | 0.4% |
| | | 277 | 0.3% |
| g | 58 | 0.1% |
| Other values (20) | 403 | 0.4% |
Perovskite_composition_b_ions_coefficients
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 148 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 34 |
| Missing (%) | 0.1% |
| Memory size | 332.1 KiB |
| 1 | |
|---|---|
| 1.0 | 462 |
| 4 | 317 |
| 2 | 247 |
| 1 | 1 | 230 |
| Other values (143) | 1381 |
Length
| Max length | 19 |
|---|---|
| Median length | 1 |
| Mean length | 1.2025764 |
| Min length | 1 |
Characters and Unicode
| Total characters | 51065 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 51 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 39826 | |
| 1.0 | 462 | 1.1% |
| 4 | 317 | 0.7% |
| 2 | 247 | 0.6% |
| 1 | 1 | 230 | 0.5% |
| 3 | 226 | 0.5% |
| 0.5; 0.5 | 175 | 0.4% |
| 5 | 128 | 0.3% |
| 0.75; 0.25 | 73 | 0.2% |
| 0.4; 0.6 | 53 | 0.1% |
| Other values (138) | 726 | 1.7% |
Length
| Value | Count | Frequency (%) |
| 1 | 40411 | |
| 1.0 | 462 | 1.1% |
| 0.5 | 351 | 0.8% |
| 4 | 318 | 0.7% |
| 277 | 0.6% | |
| 2 | 256 | 0.6% |
| 3 | 244 | 0.6% |
| 5 | 128 | 0.3% |
| 0.25 | 107 | 0.2% |
| 0.75 | 105 | 0.2% |
| Other values (130) | 1158 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 41078 | |
| 0 | 2321 | 4.5% |
| . | 2068 | 4.0% |
| 1354 | 2.7% | |
| 5 | 933 | 1.8% |
| ; | 800 | 1.6% |
| 2 | 474 | 0.9% |
| 4 | 455 | 0.9% |
| 9 | 432 | 0.8% |
| 3 | 344 | 0.7% |
| Other values (5) | 806 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 46565 | |
| Other Punctuation | 2868 | 5.6% |
| Space Separator | 1354 | 2.7% |
| Math Symbol | 277 | 0.5% |
| Lowercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 41078 | |
| 0 | 2321 | 5.0% |
| 5 | 933 | 2.0% |
| 2 | 474 | 1.0% |
| 4 | 455 | 1.0% |
| 9 | 432 | 0.9% |
| 3 | 344 | 0.7% |
| 7 | 241 | 0.5% |
| 6 | 163 | 0.4% |
| 8 | 124 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2068 | |
| ; | 800 | 27.9% |
Space Separator
| Value | Count | Frequency (%) |
| 1354 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 277 |
Lowercase Letter
| Value | Count | Frequency (%) |
| x | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 51064 | |
| Latin | 1 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 41078 | |
| 0 | 2321 | 4.5% |
| . | 2068 | 4.0% |
| 1354 | 2.7% | |
| 5 | 933 | 1.8% |
| ; | 800 | 1.6% |
| 2 | 474 | 0.9% |
| 4 | 455 | 0.9% |
| 9 | 432 | 0.8% |
| 3 | 344 | 0.7% |
| Other values (4) | 805 | 1.6% |
Latin
| Value | Count | Frequency (%) |
| x | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51065 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 41078 | |
| 0 | 2321 | 4.5% |
| . | 2068 | 4.0% |
| 1354 | 2.7% | |
| 5 | 933 | 1.8% |
| ; | 800 | 1.6% |
| 2 | 474 | 0.9% |
| 4 | 455 | 0.9% |
| 9 | 432 | 0.8% |
| 3 | 344 | 0.7% |
| Other values (5) | 806 | 1.6% |
Perovskite_composition_c_ions
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Memory size | 332.1 KiB |
| I | |
|---|---|
| Br; I | |
| Br | 981 |
| Br; I | I | 87 |
| I | I | 86 |
| Other values (28) | 234 |
Length
| Max length | 29 |
|---|---|
| Median length | 1 |
| Mean length | 1.9401506 |
| Min length | 1 |
Characters and Unicode
| Total characters | 82437 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | I |
|---|---|
| 2nd row | Br; I |
| 3rd row | Br; I |
| 4th row | Br; I |
| 5th row | Br |
Common Values
| Value | Count | Frequency (%) |
| I | 31966 | |
| Br; I | 9136 | 21.5% |
| Br | 981 | 2.3% |
| Br; I | I | 87 | 0.2% |
| I | I | 86 | 0.2% |
| Cl | 35 | 0.1% |
| Br; I | Br; I | 33 | 0.1% |
| O | 32 | 0.1% |
| Cl; I | 21 | < 0.1% |
| Br; Cl; I | 11 | < 0.1% |
| Other values (23) | 102 | 0.2% |
Length
| Value | Count | Frequency (%) |
| i | 41672 | |
| br | 10358 | 19.7% |
| 277 | 0.5% | |
| cl | 80 | 0.2% |
| o | 32 | 0.1% |
| scn | 13 | < 0.1% |
| pf6 | 12 | < 0.1% |
| f | 12 | < 0.1% |
| bf4 | 9 | < 0.1% |
| s | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 41672 | |
| B | 10367 | 12.6% |
| r | 10358 | 12.6% |
| 9977 | 12.1% | |
| ; | 9423 | 11.4% |
| | | 277 | 0.3% |
| C | 93 | 0.1% |
| l | 80 | 0.1% |
| F | 33 | < 0.1% |
| ) | 32 | < 0.1% |
| Other values (7) | 125 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 52237 | |
| Lowercase Letter | 10438 | 12.7% |
| Space Separator | 9977 | 12.1% |
| Other Punctuation | 9423 | 11.4% |
| Math Symbol | 277 | 0.3% |
| Close Punctuation | 32 | < 0.1% |
| Open Punctuation | 32 | < 0.1% |
| Decimal Number | 21 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 41672 | |
| B | 10367 | 19.8% |
| C | 93 | 0.2% |
| F | 33 | 0.1% |
| O | 32 | 0.1% |
| S | 15 | < 0.1% |
| N | 13 | < 0.1% |
| P | 12 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 10358 | |
| l | 80 | 0.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 12 | |
| 4 | 9 |
Space Separator
| Value | Count | Frequency (%) |
| 9977 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 9423 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 277 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 32 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 32 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 62675 | |
| Common | 19762 | 24.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 41672 | |
| B | 10367 | 16.5% |
| r | 10358 | 16.5% |
| C | 93 | 0.1% |
| l | 80 | 0.1% |
| F | 33 | 0.1% |
| O | 32 | 0.1% |
| S | 15 | < 0.1% |
| N | 13 | < 0.1% |
| P | 12 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 9977 | ||
| ; | 9423 | |
| | | 277 | 1.4% |
| ) | 32 | 0.2% |
| ( | 32 | 0.2% |
| 6 | 12 | 0.1% |
| 4 | 9 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 82437 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 41672 | |
| B | 10367 | 12.6% |
| r | 10358 | 12.6% |
| 9977 | 12.1% | |
| ; | 9423 | 11.4% |
| | | 277 | 0.3% |
| C | 93 | 0.1% |
| l | 80 | 0.1% |
| F | 33 | < 0.1% |
| ) | 32 | < 0.1% |
| Other values (7) | 125 | 0.2% |
Perovskite_composition_c_ions_coefficients
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 370 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 41 |
| Missing (%) | 0.1% |
| Memory size | 332.1 KiB |
| 3 | |
|---|---|
| 0.45; 2.55 | 2269 |
| 0.51; 2.49 | 1858 |
| 1; 2 | 737 |
| 0.3; 2.7 | 363 |
| Other values (365) |
Length
| Max length | 34 |
|---|---|
| Median length | 1 |
| Mean length | 2.7958592 |
| Min length | 1 |
Characters and Unicode
| Total characters | 118701 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 141 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 0.3; 2.7 |
| 3rd row | 1.5; 1.5 |
| 4th row | 2.7; 0.3 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 31712 | |
| 0.45; 2.55 | 2269 | 5.3% |
| 0.51; 2.49 | 1858 | 4.4% |
| 1; 2 | 737 | 1.7% |
| 0.3; 2.7 | 363 | 0.9% |
| 2; 1 | 327 | 0.8% |
| 13 | 317 | 0.7% |
| 0.5; 2.5 | 285 | 0.7% |
| 0.15; 2.85 | 223 | 0.5% |
| 9 | 218 | 0.5% |
| Other values (360) | 4147 | 9.8% |
Length
| Value | Count | Frequency (%) |
| 3 | 31942 | |
| 0.45 | 2369 | 4.5% |
| 2.55 | 2338 | 4.5% |
| 0.51 | 1945 | 3.7% |
| 2.49 | 1915 | 3.7% |
| 1 | 1226 | 2.3% |
| 2 | 1131 | 2.2% |
| 0.3 | 434 | 0.8% |
| 2.7 | 411 | 0.8% |
| 13 | 317 | 0.6% |
| Other values (331) | 8342 | 15.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 33736 | |
| . | 16329 | |
| 5 | 12066 | 10.2% |
| 9915 | 8.4% | |
| 2 | 9857 | 8.3% |
| ; | 9361 | 7.9% |
| 0 | 8808 | 7.4% |
| 4 | 5756 | 4.8% |
| 1 | 5645 | 4.8% |
| 9 | 3225 | 2.7% |
| Other values (5) | 4003 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 82743 | |
| Other Punctuation | 25690 | 21.6% |
| Space Separator | 9915 | 8.4% |
| Math Symbol | 277 | 0.2% |
| Lowercase Letter | 76 | 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 33736 | |
| 5 | 12066 | 14.6% |
| 2 | 9857 | 11.9% |
| 0 | 8808 | 10.6% |
| 4 | 5756 | 7.0% |
| 1 | 5645 | 6.8% |
| 9 | 3225 | 3.9% |
| 6 | 1280 | 1.5% |
| 7 | 1279 | 1.5% |
| 8 | 1091 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 16329 | |
| ; | 9361 |
Space Separator
| Value | Count | Frequency (%) |
| 9915 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 277 |
Lowercase Letter
| Value | Count | Frequency (%) |
| x | 76 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 118625 | |
| Latin | 76 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 33736 | |
| . | 16329 | |
| 5 | 12066 | 10.2% |
| 9915 | 8.4% | |
| 2 | 9857 | 8.3% |
| ; | 9361 | 7.9% |
| 0 | 8808 | 7.4% |
| 4 | 5756 | 4.9% |
| 1 | 5645 | 4.8% |
| 9 | 3225 | 2.7% |
| Other values (4) | 3927 | 3.3% |
Latin
| Value | Count | Frequency (%) |
| x | 76 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 118701 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 33736 | |
| . | 16329 | |
| 5 | 12066 | 10.2% |
| 9915 | 8.4% | |
| 2 | 9857 | 8.3% |
| ; | 9361 | 7.9% |
| 0 | 8808 | 7.4% |
| 4 | 5756 | 4.8% |
| 1 | 5645 | 4.8% |
| 9 | 3225 | 2.7% |
| Other values (5) | 4003 | 3.4% |
Perovskite_composition_none_stoichiometry_components_in_excess
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 45 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 35521 |
| Missing (%) | 83.6% |
| Memory size | 332.1 KiB |
| PbI2 | |
|---|---|
| Stoichiometric | |
| MAI | |
| MA | |
| PbI2; PbBr2 | 212 |
| Other values (40) |
Length
| Max length | 31 |
|---|---|
| Median length | 20 |
| Mean length | 5.9568521 |
| Min length | 1 |
Characters and Unicode
| Total characters | 41555 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | MA |
|---|---|
| 2nd row | HCl |
| 3rd row | HCl |
| 4th row | PbI2 |
| 5th row | PbI2 |
Common Values
| Value | Count | Frequency (%) |
| PbI2 | 3212 | 7.6% |
| Stoichiometric | 1390 | 3.3% |
| MAI | 909 | 2.1% |
| MA | 663 | 1.6% |
| PbI2; PbBr2 | 212 | 0.5% |
| Pb | 87 | 0.2% |
| PbCl2 | 78 | 0.2% |
| MACl | 65 | 0.2% |
| PbBr2 | 43 | 0.1% |
| CsBr | 38 | 0.1% |
| Other values (35) | 279 | 0.7% |
| (Missing) | 35521 |
Length
| Value | Count | Frequency (%) |
| pbi2 | 3460 | |
| stoichiometric | 1405 | |
| mai | 922 | 12.6% |
| ma | 663 | 9.1% |
| pbbr2 | 273 | 3.7% |
| pb | 87 | 1.2% |
| pbcl2 | 78 | 1.1% |
| macl | 66 | 0.9% |
| fai | 48 | 0.7% |
| sni2 | 38 | 0.5% |
| Other values (27) | 252 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 4548 | |
| i | 4224 | |
| P | 3917 | |
| b | 3912 | |
| 2 | 3885 | |
| c | 2816 | 6.8% |
| t | 2810 | 6.8% |
| o | 2810 | 6.8% |
| r | 1762 | 4.2% |
| A | 1756 | 4.2% |
| Other values (26) | 9115 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22919 | |
| Uppercase Letter | 14106 | |
| Decimal Number | 3916 | 9.4% |
| Space Separator | 316 | 0.8% |
| Other Punctuation | 278 | 0.7% |
| Math Symbol | 19 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 4548 | |
| P | 3917 | |
| A | 1756 | 12.4% |
| M | 1677 | 11.9% |
| S | 1483 | 10.5% |
| B | 359 | 2.5% |
| C | 241 | 1.7% |
| F | 70 | 0.5% |
| H | 19 | 0.1% |
| N | 17 | 0.1% |
| Other values (4) | 19 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4224 | |
| b | 3912 | |
| c | 2816 | |
| t | 2810 | |
| o | 2810 | |
| r | 1762 | |
| h | 1405 | 6.1% |
| e | 1405 | 6.1% |
| m | 1405 | 6.1% |
| l | 165 | 0.7% |
| Other values (4) | 205 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3885 | |
| 4 | 15 | 0.4% |
| 3 | 15 | 0.4% |
| 5 | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 316 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 278 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 19 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37025 | |
| Common | 4530 | 10.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 4548 | |
| i | 4224 | |
| P | 3917 | |
| b | 3912 | |
| c | 2816 | |
| t | 2810 | |
| o | 2810 | |
| r | 1762 | 4.8% |
| A | 1756 | 4.7% |
| M | 1677 | 4.5% |
| Other values (18) | 6793 |
Common
| Value | Count | Frequency (%) |
| 2 | 3885 | |
| 316 | 7.0% | |
| ; | 278 | 6.1% |
| | | 19 | 0.4% |
| 4 | 15 | 0.3% |
| 3 | 15 | 0.3% |
| 5 | 1 | < 0.1% |
| - | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41555 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 4548 | |
| i | 4224 | |
| P | 3917 | |
| b | 3912 | |
| 2 | 3885 | |
| c | 2816 | 6.8% |
| t | 2810 | 6.8% |
| o | 2810 | 6.8% |
| r | 1762 | 4.2% |
| A | 1756 | 4.2% |
| Other values (26) | 9115 |
Perovskite_additives_compounds
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 979 |
|---|---|
| Distinct (%) | 7.1% |
| Missing | 28675 |
| Missing (%) | 67.5% |
| Memory size | 332.1 KiB |
| Cl | |
|---|---|
| Undoped | |
| Unknown | 558 |
| 5-AVAI | 349 |
| SnF2 | 240 |
| Other values (974) |
Length
| Max length | 142 |
|---|---|
| Median length | 90 |
| Mean length | 5.6151064 |
| Min length | 1 |
Characters and Unicode
| Total characters | 77612 |
|---|---|
| Distinct characters | 84 |
| Distinct categories | 11 ? |
| Distinct scripts | 4 ? |
| Distinct blocks | 5 ? |
Unique
| Unique | 284 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | SnF2 |
|---|---|
| 2nd row | SnF2 |
| 3rd row | SnF2 |
| 4th row | SnF2 |
| 5th row | Cl |
Common Values
| Value | Count | Frequency (%) |
| Cl | 5182 | 12.2% |
| Undoped | 1313 | 3.1% |
| Unknown | 558 | 1.3% |
| 5-AVAI | 349 | 0.8% |
| SnF2 | 240 | 0.6% |
| HI | 210 | 0.5% |
| Rb | 175 | 0.4% |
| Pb(SCN)2 | 174 | 0.4% |
| Acetate | 119 | 0.3% |
| KI | 105 | 0.2% |
| Other values (969) | 5397 | 12.7% |
| (Missing) | 28675 |
Length
| Value | Count | Frequency (%) |
| cl | 5946 | |
| undoped | 1341 | 8.4% |
| unknown | 558 | 3.5% |
| snf2 | 428 | 2.7% |
| 5-avai | 367 | 2.3% |
| hi | 304 | 1.9% |
| acetate | 285 | 1.8% |
| pb(scn)2 | 230 | 1.4% |
| rb | 205 | 1.3% |
| pbcl2 | 133 | 0.8% |
| Other values (866) | 6114 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 7991 | 10.3% |
| l | 7773 | 10.0% |
| n | 5026 | 6.5% |
| d | 3690 | 4.8% |
| o | 3622 | 4.7% |
| e | 3612 | 4.7% |
| A | 2483 | 3.2% |
| i | 2262 | 2.9% |
| 2088 | 2.7% | |
| p | 2080 | 2.7% |
| Other values (74) | 36985 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 40836 | |
| Uppercase Letter | 26259 | |
| Decimal Number | 3815 | 4.9% |
| Space Separator | 2088 | 2.7% |
| Dash Punctuation | 1782 | 2.3% |
| Other Punctuation | 1652 | 2.1% |
| Close Punctuation | 550 | 0.7% |
| Open Punctuation | 550 | 0.7% |
| Math Symbol | 61 | 0.1% |
| Format | 18 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 7991 | |
| A | 2483 | 9.5% |
| U | 1963 | 7.5% |
| P | 1920 | 7.3% |
| I | 1826 | 7.0% |
| H | 1411 | 5.4% |
| S | 1236 | 4.7% |
| N | 1102 | 4.2% |
| F | 801 | 3.1% |
| M | 740 | 2.8% |
| Other values (16) | 4786 |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 7773 | |
| n | 5026 | |
| d | 3690 | |
| o | 3622 | |
| e | 3612 | |
| i | 2262 | 5.5% |
| p | 2080 | 5.1% |
| a | 1859 | 4.6% |
| t | 1376 | 3.4% |
| r | 1372 | 3.4% |
| Other values (16) | 8164 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1771 | |
| 3 | 472 | 12.4% |
| 4 | 431 | 11.3% |
| 5 | 423 | 11.1% |
| 1 | 260 | 6.8% |
| 0 | 179 | 4.7% |
| 6 | 168 | 4.4% |
| 8 | 49 | 1.3% |
| 7 | 41 | 1.1% |
| 9 | 17 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 1420 | |
| , | 142 | 8.6% |
| : | 39 | 2.4% |
| @ | 29 | 1.8% |
| . | 13 | 0.8% |
| / | 8 | 0.5% |
| ′ | 1 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1549 | |
| ‐ | 221 | 12.4% |
| ‑ | 7 | 0.4% |
| – | 5 | 0.3% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 55 | |
| ∙ | 4 | 6.6% |
| + | 2 | 3.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 523 | |
| ] | 27 | 4.9% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 523 | |
| [ | 27 | 4.9% |
Space Separator
| Value | Count | Frequency (%) |
| 2088 |
Format
| Value | Count | Frequency (%) |
| | 18 |
Control
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 67084 | |
| Common | 10513 | 13.5% |
| Greek | 11 | < 0.1% |
| Arabic | 4 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 7991 | 11.9% |
| l | 7773 | 11.6% |
| n | 5026 | 7.5% |
| d | 3690 | 5.5% |
| o | 3622 | 5.4% |
| e | 3612 | 5.4% |
| A | 2483 | 3.7% |
| i | 2262 | 3.4% |
| p | 2080 | 3.1% |
| U | 1963 | 2.9% |
| Other values (41) | 26582 |
Common
| Value | Count | Frequency (%) |
| 2088 | ||
| 2 | 1771 | |
| - | 1549 | |
| ; | 1420 | |
| ) | 523 | 5.0% |
| ( | 523 | 5.0% |
| 3 | 472 | 4.5% |
| 4 | 431 | 4.1% |
| 5 | 423 | 4.0% |
| 1 | 260 | 2.5% |
| Other values (21) | 1053 |
Greek
| Value | Count | Frequency (%) |
| α | 11 |
Arabic
| Value | Count | Frequency (%) |
| ۰ | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 77341 | |
| Punctuation | 234 | 0.3% |
| None | 29 | < 0.1% |
| Arabic | 4 | < 0.1% |
| Math Operators | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 7991 | 10.3% |
| l | 7773 | 10.1% |
| n | 5026 | 6.5% |
| d | 3690 | 4.8% |
| o | 3622 | 4.7% |
| e | 3612 | 4.7% |
| A | 2483 | 3.2% |
| i | 2262 | 2.9% |
| 2088 | 2.7% | |
| p | 2080 | 2.7% |
| Other values (66) | 36714 |
Punctuation
| Value | Count | Frequency (%) |
| ‐ | 221 | |
| ‑ | 7 | 3.0% |
| – | 5 | 2.1% |
| ′ | 1 | 0.4% |
None
| Value | Count | Frequency (%) |
| | 18 | |
| α | 11 |
Arabic
| Value | Count | Frequency (%) |
| ۰ | 4 |
Math Operators
| Value | Count | Frequency (%) |
| ∙ | 4 |
Perovskite_additives_concentrations
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 706 |
|---|---|
| Distinct (%) | 14.4% |
| Missing | 37606 |
| Missing (%) | 88.5% |
| Memory size | 332.1 KiB |
| 0.05 | |
|---|---|
| 0.1 | |
| 0.01 | 238 |
| 0.03 | 171 |
| 0.02 | 158 |
| Other values (701) |
Length
| Max length | 31 |
|---|---|
| Median length | 28 |
| Mean length | 5.73257 |
| Min length | 1 |
Characters and Unicode
| Total characters | 28038 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 300 ? |
|---|---|
| Unique (%) | 6.1% |
Sample
| 1st row | 0.2 |
|---|---|
| 2nd row | 0.2 |
| 3rd row | 0.2 |
| 4th row | 0.2 |
| 5th row | 0.25 |
Common Values
| Value | Count | Frequency (%) |
| 0.05 | 395 | 0.9% |
| 0.1 | 371 | 0.9% |
| 0.01 | 238 | 0.6% |
| 0.03 | 171 | 0.4% |
| 0.02 | 158 | 0.4% |
| 0.66 | 110 | 0.3% |
| 0.15 | 103 | 0.2% |
| 0.2 | 94 | 0.2% |
| 0.25 | 79 | 0.2% |
| 0.5 | 76 | 0.2% |
| Other values (696) | 3096 | 7.3% |
| (Missing) | 37606 |
Length
| Value | Count | Frequency (%) |
| 0.05 | 499 | 6.9% |
| 0.1 | 487 | 6.7% |
| mol | 455 | 6.3% |
| nan | 367 | 5.1% |
| 0.01 | 321 | 4.4% |
| mg/ml | 294 | 4.0% |
| wt | 293 | 4.0% |
| 278 | 3.8% | |
| 0.03 | 195 | 2.7% |
| 1 | 188 | 2.6% |
| Other values (278) | 3887 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6879 | |
| . | 4146 | |
| 2388 | 8.5% | |
| 1 | 1927 | 6.9% |
| 5 | 1755 | 6.3% |
| m | 1161 | 4.1% |
| % | 1139 | 4.1% |
| 2 | 1009 | 3.6% |
| l | 946 | 3.4% |
| 3 | 841 | 3.0% |
| Other values (35) | 5847 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14084 | |
| Other Punctuation | 6305 | |
| Lowercase Letter | 5003 | 17.8% |
| Space Separator | 2388 | 8.5% |
| Uppercase Letter | 200 | 0.7% |
| Math Symbol | 35 | 0.1% |
| Dash Punctuation | 17 | 0.1% |
| Connector Punctuation | 5 | < 0.1% |
| Format | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 1161 | |
| l | 946 | |
| n | 742 | |
| o | 625 | |
| a | 371 | 7.4% |
| g | 313 | 6.3% |
| t | 309 | 6.2% |
| w | 299 | 6.0% |
| v | 146 | 2.9% |
| e | 21 | 0.4% |
| Other values (7) | 70 | 1.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6879 | |
| 1 | 1927 | 13.7% |
| 5 | 1755 | 12.5% |
| 2 | 1009 | 7.2% |
| 3 | 841 | 6.0% |
| 6 | 667 | 4.7% |
| 7 | 313 | 2.2% |
| 4 | 293 | 2.1% |
| 8 | 216 | 1.5% |
| 9 | 184 | 1.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 108 | |
| A | 16 | 8.0% |
| G | 15 | 7.5% |
| E | 15 | 7.5% |
| P | 15 | 7.5% |
| L | 14 | 7.0% |
| V | 8 | 4.0% |
| I | 8 | 4.0% |
| W | 1 | 0.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4146 | |
| % | 1139 | 18.1% |
| ; | 691 | 11.0% |
| / | 329 | 5.2% |
Space Separator
| Value | Count | Frequency (%) |
| 2388 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 35 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5 |
Format
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22846 | |
| Latin | 5192 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 1161 | |
| l | 946 | |
| n | 742 | |
| o | 625 | |
| a | 371 | 7.1% |
| g | 313 | 6.0% |
| t | 309 | 6.0% |
| w | 299 | 5.8% |
| v | 146 | 2.8% |
| M | 108 | 2.1% |
| Other values (15) | 172 | 3.3% |
Common
| Value | Count | Frequency (%) |
| 0 | 6879 | |
| . | 4146 | |
| 2388 | 10.5% | |
| 1 | 1927 | 8.4% |
| 5 | 1755 | 7.7% |
| % | 1139 | 5.0% |
| 2 | 1009 | 4.4% |
| 3 | 841 | 3.7% |
| ; | 691 | 3.0% |
| 6 | 667 | 2.9% |
| Other values (10) | 1404 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28026 | |
| None | 11 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6879 | |
| . | 4146 | |
| 2388 | 8.5% | |
| 1 | 1927 | 6.9% |
| 5 | 1755 | 6.3% |
| m | 1161 | 4.1% |
| % | 1139 | 4.1% |
| 2 | 1009 | 3.6% |
| l | 946 | 3.4% |
| 3 | 841 | 3.0% |
| Other values (33) | 5835 |
None
| Value | Count | Frequency (%) |
| µ | 11 |
Punctuation
| Value | Count | Frequency (%) |
| | 1 |
Perovskite_thickness
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 29124 |
|---|---|
| Missing (%) | 68.5% |
| Memory size | 332.1 KiB |
Perovskite_band_gap
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 10585 |
|---|---|
| Missing (%) | 24.9% |
| Memory size | 332.1 KiB |
Perovskite_band_gap_graded
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.6 KiB |
| False | |
|---|---|
| True | 60 |
| Value | Count | Frequency (%) |
| False | 42437 | |
| True | 60 | 0.1% |
Perovskite_pl_max
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 32254 |
|---|---|
| Missing (%) | 75.9% |
| Memory size | 332.1 KiB |
Perovskite_deposition_number_of_deposition_steps
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.2792197 |
| Minimum | 0 |
|---|---|
| Maximum | 12 |
| Zeros | 94 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 332.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 12 |
| Range | 12 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.51945352 |
|---|---|
| Coefficient of variation (CV) | 0.4060706 |
| Kurtosis | 41.145649 |
| Mean | 1.2792197 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.4095069 |
| Sum | 54363 |
| Variance | 0.26983196 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 31191 | |
| 2 | 10660 | 25.1% |
| 3 | 475 | 1.1% |
| 0 | 94 | 0.2% |
| 4 | 53 | 0.1% |
| 10 | 12 | < 0.1% |
| 6 | 5 | < 0.1% |
| 12 | 4 | < 0.1% |
| 5 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 94 | 0.2% |
| 1 | 31191 | |
| 2 | 10660 | 25.1% |
| 3 | 475 | 1.1% |
| 4 | 53 | 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 5 | < 0.1% |
| 7 | 1 | < 0.1% |
| 10 | 12 | < 0.1% |
| 12 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 12 | 4 | < 0.1% |
| 10 | 12 | < 0.1% |
| 7 | 1 | < 0.1% |
| 6 | 5 | < 0.1% |
| 5 | 2 | < 0.1% |
| 4 | 53 | 0.1% |
| 3 | 475 | 1.1% |
| 2 | 10660 | 25.1% |
| 1 | 31191 | |
| 0 | 94 | 0.2% |
Perovskite_deposition_procedure
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 198 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| Spin-coating | |
|---|---|
| Spin-coating >> Spin-coating | |
| Spin-coating >> CBD | |
| Drop-infiltration | 632 |
| Spin-coating >> Gas reaction | 551 |
| Other values (193) |
Length
| Max length | 156 |
|---|---|
| Median length | 12 |
| Mean length | 15.895146 |
| Min length | 3 |
Characters and Unicode
| Total characters | 675496 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 29 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Spin-coating |
|---|---|
| 2nd row | Spin-coating |
| 3rd row | Spin-coating |
| 4th row | Spin-coating |
| 5th row | Spin-coating |
Common Values
| Value | Count | Frequency (%) |
| Spin-coating | 29100 | |
| Spin-coating >> Spin-coating | 5471 | 12.9% |
| Spin-coating >> CBD | 3178 | 7.5% |
| Drop-infiltration | 632 | 1.5% |
| Spin-coating >> Gas reaction | 551 | 1.3% |
| Co-evaporation | 398 | 0.9% |
| Doctor blading | 266 | 0.6% |
| Unknown | 147 | 0.3% |
| Slot-die coating | 143 | 0.3% |
| Evaporation >> Gas reaction | 134 | 0.3% |
| Other values (188) | 2477 | 5.8% |
Length
| Value | Count | Frequency (%) |
| spin-coating | 45545 | |
| 11898 | 17.4% | |
| cbd | 3657 | 5.3% |
| evaporation | 944 | 1.4% |
| reaction | 928 | 1.4% |
| gas | 872 | 1.3% |
| drop-infiltration | 772 | 1.1% |
| co-evaporation | 434 | 0.6% |
| doctor | 291 | 0.4% |
| blading | 291 | 0.4% |
| Other values (78) | 2807 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 98506 | |
| n | 97670 | |
| o | 53758 | |
| a | 53223 | |
| t | 51610 | |
| p | 49009 | |
| c | 48107 | |
| - | 47471 | |
| g | 47087 | |
| S | 46039 | |
| Other values (37) | 83016 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 516571 | |
| Uppercase Letter | 61835 | 9.2% |
| Dash Punctuation | 47471 | 7.0% |
| Space Separator | 25942 | 3.8% |
| Math Symbol | 23677 | 3.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 98506 | |
| n | 97670 | |
| o | 53758 | |
| a | 53223 | |
| t | 51610 | |
| p | 49009 | |
| c | 48107 | |
| g | 47087 | |
| r | 5573 | 1.1% |
| e | 2401 | 0.5% |
| Other values (15) | 9627 | 1.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 46039 | |
| D | 5212 | 8.4% |
| C | 4201 | 6.8% |
| B | 3671 | 5.9% |
| E | 1007 | 1.6% |
| G | 894 | 1.4% |
| U | 299 | 0.5% |
| R | 151 | 0.2% |
| I | 137 | 0.2% |
| A | 69 | 0.1% |
| Other values (8) | 155 | 0.3% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 23558 | |
| | | 119 | 0.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 47471 |
Space Separator
| Value | Count | Frequency (%) |
| 25942 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 578406 | |
| Common | 97090 | 14.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 98506 | |
| n | 97670 | |
| o | 53758 | |
| a | 53223 | |
| t | 51610 | |
| p | 49009 | |
| c | 48107 | |
| g | 47087 | |
| S | 46039 | |
| r | 5573 | 1.0% |
| Other values (33) | 27824 | 4.8% |
Common
| Value | Count | Frequency (%) |
| - | 47471 | |
| 25942 | ||
| > | 23558 | |
| | | 119 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 675496 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 98506 | |
| n | 97670 | |
| o | 53758 | |
| a | 53223 | |
| t | 51610 | |
| p | 49009 | |
| c | 48107 | |
| - | 47471 | |
| g | 47087 | |
| S | 46039 | |
| Other values (37) | 83016 |
Perovskite_deposition_synthesis_atmosphere
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 186 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| Unknown | |
|---|---|
| N2 | |
| Air | |
| N2 >> N2 | |
| Air >> Air | |
| Other values (181) |
Length
| Max length | 56 |
|---|---|
| Median length | 52 |
| Mean length | 5.630068 |
| Min length | 2 |
Characters and Unicode
| Total characters | 239261 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 37 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | N2 |
|---|---|
| 2nd row | N2 |
| 3rd row | N2 |
| 4th row | N2 |
| 5th row | N2 |
Common Values
| Value | Count | Frequency (%) |
| Unknown | 14843 | |
| N2 | 14652 | |
| Air | 3448 | 8.1% |
| N2 >> N2 | 2805 | 6.6% |
| Air >> Air | 1962 | 4.6% |
| Ar | 692 | 1.6% |
| Dry air | 651 | 1.5% |
| Vacuum | 465 | 1.1% |
| Ar >> Ar | 212 | 0.5% |
| Dry air >> Dry air | 204 | 0.5% |
| Other values (176) | 2563 | 6.0% |
Length
| Value | Count | Frequency (%) |
| n2 | 21666 | |
| unknown | 15265 | |
| air | 9996 | |
| 8067 | 13.3% | |
| vacuum | 1679 | 2.8% |
| ar | 1203 | 2.0% |
| dry | 1131 | 1.9% |
| mai | 582 | 1.0% |
| ambient | 428 | 0.7% |
| methylamin | 146 | 0.2% |
| Other values (40) | 462 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 46453 | |
| 2 | 21723 | |
| N | 21676 | |
| 18128 | 7.6% | |
| > | 15930 | 6.7% |
| o | 15341 | 6.4% |
| U | 15265 | 6.4% |
| k | 15265 | 6.4% |
| w | 15265 | 6.4% |
| r | 12471 | 5.2% |
| Other values (42) | 41744 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 129273 | |
| Uppercase Letter | 53220 | |
| Decimal Number | 21747 | 9.1% |
| Space Separator | 18128 | 7.6% |
| Math Symbol | 16029 | 6.7% |
| Other Punctuation | 861 | 0.4% |
| Dash Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 46453 | |
| o | 15341 | 11.9% |
| k | 15265 | 11.8% |
| w | 15265 | 11.8% |
| r | 12471 | 9.6% |
| i | 10600 | 8.2% |
| u | 3362 | 2.6% |
| a | 2969 | 2.3% |
| m | 2261 | 1.7% |
| c | 1683 | 1.3% |
| Other values (11) | 3603 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 21676 | |
| U | 15265 | |
| A | 11352 | |
| V | 1679 | 3.2% |
| D | 1149 | 2.2% |
| M | 882 | 1.7% |
| I | 751 | 1.4% |
| F | 122 | 0.2% |
| B | 103 | 0.2% |
| C | 101 | 0.2% |
| Other values (8) | 140 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 21723 | |
| 4 | 12 | 0.1% |
| 5 | 6 | < 0.1% |
| 1 | 2 | < 0.1% |
| 7 | 2 | < 0.1% |
| 0 | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 856 | |
| @ | 3 | 0.3% |
| , | 2 | 0.2% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 15930 | |
| | | 99 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 18128 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 182493 | |
| Common | 56768 | 23.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 46453 | |
| N | 21676 | |
| o | 15341 | 8.4% |
| U | 15265 | 8.4% |
| k | 15265 | 8.4% |
| w | 15265 | 8.4% |
| r | 12471 | 6.8% |
| A | 11352 | 6.2% |
| i | 10600 | 5.8% |
| u | 3362 | 1.8% |
| Other values (29) | 15443 | 8.5% |
Common
| Value | Count | Frequency (%) |
| 2 | 21723 | |
| 18128 | ||
| > | 15930 | |
| ; | 856 | 1.5% |
| | | 99 | 0.2% |
| 4 | 12 | < 0.1% |
| 5 | 6 | < 0.1% |
| - | 3 | < 0.1% |
| @ | 3 | < 0.1% |
| , | 2 | < 0.1% |
| Other values (3) | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 239261 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 46453 | |
| 2 | 21723 | |
| N | 21676 | |
| 18128 | 7.6% | |
| > | 15930 | 6.7% |
| o | 15341 | 6.4% |
| U | 15265 | 6.4% |
| k | 15265 | 6.4% |
| w | 15265 | 6.4% |
| r | 12471 | 5.2% |
| Other values (42) | 41744 |
Perovskite_deposition_solvents
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 343 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| DMF; DMSO | |
|---|---|
| DMF | |
| DMF >> IPA | |
| DMSO; GBL | |
| DMF; DMSO >> IPA | |
| Other values (338) |
Length
| Max length | 213 |
|---|---|
| Median length | 206 |
| Mean length | 8.5521802 |
| Min length | 3 |
Characters and Unicode
| Total characters | 363442 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 66 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | DMSO |
|---|---|
| 2nd row | DMSO |
| 3rd row | DMSO |
| 4th row | DMSO |
| 5th row | DMSO |
Common Values
| Value | Count | Frequency (%) |
| DMF; DMSO | 13990 | |
| DMF | 8699 | |
| DMF >> IPA | 5574 | 13.1% |
| DMSO; GBL | 3019 | 7.1% |
| DMF; DMSO >> IPA | 1903 | 4.5% |
| DMSO | 1747 | 4.1% |
| Unknown | 1075 | 2.5% |
| GBL | 974 | 2.3% |
| DMF >> none | 621 | 1.5% |
| none | 502 | 1.2% |
| Other values (333) | 4393 | 10.3% |
Length
| Value | Count | Frequency (%) |
| dmf | 33577 | |
| dmso | 21850 | |
| 11767 | 13.4% | |
| ipa | 9078 | 10.3% |
| gbl | 4537 | 5.2% |
| none | 2507 | 2.9% |
| unknown | 1186 | 1.4% |
| nmp | 415 | 0.5% |
| methanol | 397 | 0.5% |
| ethanol | 362 | 0.4% |
| Other values (120) | 2166 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 56567 | |
| D | 55576 | |
| 45345 | ||
| F | 33614 | |
| > | 23276 | 6.4% |
| O | 22185 | 6.1% |
| S | 21850 | 6.0% |
| ; | 21481 | 5.9% |
| n | 10538 | 2.9% |
| P | 9621 | 2.6% |
| Other values (48) | 63389 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 234213 | |
| Space Separator | 45345 | 12.5% |
| Lowercase Letter | 38270 | 10.5% |
| Math Symbol | 23381 | 6.4% |
| Other Punctuation | 21516 | 5.9% |
| Decimal Number | 357 | 0.1% |
| Dash Punctuation | 310 | 0.1% |
| Open Punctuation | 25 | < 0.1% |
| Close Punctuation | 25 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 10538 | |
| e | 5734 | |
| o | 5622 | |
| t | 2700 | 7.1% |
| a | 2301 | 6.0% |
| l | 2094 | 5.5% |
| h | 1658 | 4.3% |
| k | 1186 | 3.1% |
| w | 1186 | 3.1% |
| i | 1075 | 2.8% |
| Other values (13) | 4176 | 10.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 56567 | |
| D | 55576 | |
| F | 33614 | |
| O | 22185 | 9.5% |
| S | 21850 | 9.3% |
| P | 9621 | 4.1% |
| A | 9160 | 3.9% |
| I | 9106 | 3.9% |
| B | 4614 | 2.0% |
| G | 4550 | 1.9% |
| Other values (9) | 7370 | 3.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 287 | |
| 1 | 27 | 7.6% |
| 7 | 19 | 5.3% |
| 3 | 12 | 3.4% |
| 4 | 7 | 2.0% |
| 9 | 5 | 1.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 21481 | |
| @ | 24 | 0.1% |
| , | 11 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 23276 | |
| | | 105 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 309 | |
| ‐ | 1 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 45345 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 25 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 25 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 272483 | |
| Common | 90959 | 25.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 56567 | |
| D | 55576 | |
| F | 33614 | |
| O | 22185 | 8.1% |
| S | 21850 | 8.0% |
| n | 10538 | 3.9% |
| P | 9621 | 3.5% |
| A | 9160 | 3.4% |
| I | 9106 | 3.3% |
| e | 5734 | 2.1% |
| Other values (32) | 38532 |
Common
| Value | Count | Frequency (%) |
| 45345 | ||
| > | 23276 | |
| ; | 21481 | |
| - | 309 | 0.3% |
| 2 | 287 | 0.3% |
| | | 105 | 0.1% |
| 1 | 27 | < 0.1% |
| ( | 25 | < 0.1% |
| ) | 25 | < 0.1% |
| @ | 24 | < 0.1% |
| Other values (6) | 55 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 363441 | |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 56567 | |
| D | 55576 | |
| 45345 | ||
| F | 33614 | |
| > | 23276 | 6.4% |
| O | 22185 | 6.1% |
| S | 21850 | 6.0% |
| ; | 21481 | 5.9% |
| n | 10538 | 2.9% |
| P | 9621 | 2.6% |
| Other values (47) | 63388 |
Punctuation
| Value | Count | Frequency (%) |
| ‐ | 1 |
Perovskite_deposition_solvents_mixing_ratios
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 528 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 2125 |
| Missing (%) | 5.0% |
| Memory size | 332.1 KiB |
| 1 | |
|---|---|
| 1 >> 1 | |
| 4; 1 | |
| 3; 7 | |
| 9; 1 | |
| Other values (523) |
Length
| Max length | 31 |
|---|---|
| Median length | 26 |
| Mean length | 4.0800307 |
| Min length | 1 |
Characters and Unicode
| Total characters | 164719 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 172 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 12152 | |
| 1 >> 1 | 7617 | |
| 4; 1 | 7593 | |
| 3; 7 | 2440 | 5.7% |
| 9; 1 | 2153 | 5.1% |
| 7; 3 | 1040 | 2.4% |
| 9; 1 >> 1 | 584 | 1.4% |
| 1; 1 | 481 | 1.1% |
| 4; 1 >> 1 | 427 | 1.0% |
| 1 >> 1 >> 1 | 266 | 0.6% |
| Other values (518) | 5619 | |
| (Missing) | 2125 | 5.0% |
Length
| Value | Count | Frequency (%) |
| 1 | 46289 | |
| 11491 | 13.7% | |
| 4 | 8749 | 10.5% |
| 3 | 4418 | 5.3% |
| 7 | 3971 | 4.7% |
| 9 | 3158 | 3.8% |
| 2 | 680 | 0.8% |
| nan | 499 | 0.6% |
| 6 | 433 | 0.5% |
| 8 | 428 | 0.5% |
| Other values (224) | 3578 | 4.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 47859 | |
| 43322 | ||
| > | 22780 | |
| ; | 20340 | |
| 4 | 8949 | 5.4% |
| 3 | 4837 | 2.9% |
| 7 | 4613 | 2.8% |
| 9 | 3799 | 2.3% |
| 0 | 1622 | 1.0% |
| 5 | 1289 | 0.8% |
| Other values (7) | 5309 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 75836 | |
| Space Separator | 43322 | |
| Math Symbol | 22881 | 13.9% |
| Other Punctuation | 21183 | 12.9% |
| Lowercase Letter | 1497 | 0.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 47859 | |
| 4 | 8949 | 11.8% |
| 3 | 4837 | 6.4% |
| 7 | 4613 | 6.1% |
| 9 | 3799 | 5.0% |
| 0 | 1622 | 2.1% |
| 5 | 1289 | 1.7% |
| 2 | 1220 | 1.6% |
| 8 | 873 | 1.2% |
| 6 | 775 | 1.0% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 22780 | |
| | | 101 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 20340 | |
| . | 843 | 4.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 998 | |
| a | 499 |
Space Separator
| Value | Count | Frequency (%) |
| 43322 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 163222 | |
| Latin | 1497 | 0.9% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 47859 | |
| 43322 | ||
| > | 22780 | |
| ; | 20340 | |
| 4 | 8949 | 5.5% |
| 3 | 4837 | 3.0% |
| 7 | 4613 | 2.8% |
| 9 | 3799 | 2.3% |
| 0 | 1622 | 1.0% |
| 5 | 1289 | 0.8% |
| Other values (5) | 3812 | 2.3% |
Latin
| Value | Count | Frequency (%) |
| n | 998 | |
| a | 499 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 164719 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 47859 | |
| 43322 | ||
| > | 22780 | |
| ; | 20340 | |
| 4 | 8949 | 5.4% |
| 3 | 4837 | 2.9% |
| 7 | 4613 | 2.8% |
| 9 | 3799 | 2.3% |
| 0 | 1622 | 1.0% |
| 5 | 1289 | 0.8% |
| Other values (7) | 5309 | 3.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 22278 | |
| True | 20219 |
Perovskite_deposition_quenching_media
Categorical
HIGH CARDINALITY  HIGH CORRELATION  IMBALANCE 
| Distinct | 94 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| Unknown | |
|---|---|
| Chlorobenzene | |
| Toluene | |
| Diethyl ether | |
| Ethyl acetate | 769 |
| Other values (89) |
Length
| Max length | 37 |
|---|---|
| Median length | 7 |
| Mean length | 9.0337906 |
| Min length | 2 |
Characters and Unicode
| Total characters | 383909 |
|---|---|
| Distinct characters | 48 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unknown |
|---|---|
| 2nd row | Unknown |
| 3rd row | Unknown |
| 4th row | Unknown |
| 5th row | Unknown |
Common Values
| Value | Count | Frequency (%) |
| Unknown | 22375 | |
| Chlorobenzene | 10363 | |
| Toluene | 3814 | 9.0% |
| Diethyl ether | 2788 | 6.6% |
| Ethyl acetate | 769 | 1.8% |
| N2 | 347 | 0.8% |
| Vacuum | 279 | 0.7% |
| Anisole | 248 | 0.6% |
| 2-Butanol | 150 | 0.4% |
| Ar | 138 | 0.3% |
| Other values (84) | 1226 | 2.9% |
Length
| Value | Count | Frequency (%) |
| unknown | 22375 | |
| chlorobenzene | 10487 | |
| toluene | 3860 | 8.3% |
| ether | 3061 | 6.6% |
| diethyl | 2806 | 6.0% |
| ethyl | 947 | 2.0% |
| acetate | 843 | 1.8% |
| n2 | 413 | 0.9% |
| anisole | 299 | 0.6% |
| vacuum | 279 | 0.6% |
| Other values (69) | 1324 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 92871 | |
| e | 50707 | |
| o | 48675 | |
| U | 22375 | 5.8% |
| k | 22375 | 5.8% |
| w | 22375 | 5.8% |
| l | 19191 | 5.0% |
| h | 17593 | 4.6% |
| r | 14378 | 3.7% |
| C | 10571 | 2.8% |
| Other values (38) | 62798 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 335655 | |
| Uppercase Letter | 43018 | 11.2% |
| Space Separator | 4197 | 1.1% |
| Decimal Number | 583 | 0.2% |
| Other Punctuation | 198 | 0.1% |
| Dash Punctuation | 184 | < 0.1% |
| Math Symbol | 74 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 92871 | |
| e | 50707 | |
| o | 48675 | |
| k | 22375 | 6.7% |
| w | 22375 | 6.7% |
| l | 19191 | 5.7% |
| h | 17593 | 5.2% |
| r | 14378 | 4.3% |
| b | 10524 | 3.1% |
| z | 10507 | 3.1% |
| Other values (15) | 26459 | 7.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 22375 | |
| C | 10571 | |
| T | 3994 | 9.3% |
| D | 2899 | 6.7% |
| E | 1042 | 2.4% |
| A | 704 | 1.6% |
| N | 423 | 1.0% |
| V | 279 | 0.6% |
| B | 170 | 0.4% |
| I | 143 | 0.3% |
| Other values (8) | 418 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 4197 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 583 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 198 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 184 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 74 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 378673 | |
| Common | 5236 | 1.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 92871 | |
| e | 50707 | |
| o | 48675 | |
| U | 22375 | 5.9% |
| k | 22375 | 5.9% |
| w | 22375 | 5.9% |
| l | 19191 | 5.1% |
| h | 17593 | 4.6% |
| r | 14378 | 3.8% |
| C | 10571 | 2.8% |
| Other values (33) | 57562 |
Common
| Value | Count | Frequency (%) |
| 4197 | ||
| 2 | 583 | 11.1% |
| ; | 198 | 3.8% |
| - | 184 | 3.5% |
| > | 74 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 383909 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 92871 | |
| e | 50707 | |
| o | 48675 | |
| U | 22375 | 5.8% |
| k | 22375 | 5.8% |
| w | 22375 | 5.8% |
| l | 19191 | 5.0% |
| h | 17593 | 4.6% |
| r | 14378 | 3.7% |
| C | 10571 | 2.8% |
| Other values (38) | 62798 |
Perovskite_deposition_quenching_media_mixing_ratios
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 41781 |
|---|---|
| Missing (%) | 98.3% |
| Memory size | 332.1 KiB |
Perovskite_deposition_quenching_media_additives_compounds
Categorical
HIGH CARDINALITY  HIGH CORRELATION  MISSING 
| Distinct | 84 |
|---|---|
| Distinct (%) | 9.6% |
| Missing | 41618 |
| Missing (%) | 97.9% |
| Memory size | 332.1 KiB |
| Undoped | |
|---|---|
| PCBM-60 | 40 |
| MAI | 21 |
| IEICO-4F | 14 |
| CsPbBr3-QDs | 13 |
| Other values (79) |
Length
| Max length | 49 |
|---|---|
| Median length | 7 |
| Mean length | 6.9283276 |
| Min length | 2 |
Characters and Unicode
| Total characters | 6090 |
|---|---|
| Distinct characters | 62 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 25 ? |
|---|---|
| Unique (%) | 2.8% |
Sample
| 1st row | BAI |
|---|---|
| 2nd row | BAI |
| 3rd row | BAI |
| 4th row | BAI |
| 5th row | C60; PEG |
Common Values
| Value | Count | Frequency (%) |
| Undoped | 527 | 1.2% |
| PCBM-60 | 40 | 0.1% |
| MAI | 21 | < 0.1% |
| IEICO-4F | 14 | < 0.1% |
| CsPbBr3-QDs | 13 | < 0.1% |
| PTAA | 12 | < 0.1% |
| Spiro-MeOTAD | 10 | < 0.1% |
| BHT | 10 | < 0.1% |
| Polyurethane | 9 | < 0.1% |
| SWCNTs | 8 | < 0.1% |
| Other values (74) | 215 | 0.5% |
| (Missing) | 41618 |
Length
| Value | Count | Frequency (%) |
| undoped | 527 | |
| pcbm-60 | 46 | 5.1% |
| mai | 22 | 2.4% |
| ieico-4f | 14 | 1.5% |
| cspbbr3-qds | 13 | 1.4% |
| ptaa | 12 | 1.3% |
| spiro-meotad | 10 | 1.1% |
| bht | 10 | 1.1% |
| itic | 9 | 1.0% |
| polyurethane | 9 | 1.0% |
| Other values (73) | 234 |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 1067 | |
| e | 595 | 9.8% |
| n | 579 | 9.5% |
| p | 560 | 9.2% |
| o | 557 | 9.1% |
| U | 527 | 8.7% |
| P | 195 | 3.2% |
| - | 155 | 2.5% |
| C | 148 | 2.4% |
| A | 145 | 2.4% |
| Other values (52) | 1562 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3805 | |
| Uppercase Letter | 1846 | |
| Decimal Number | 211 | 3.5% |
| Dash Punctuation | 159 | 2.6% |
| Space Separator | 27 | 0.4% |
| Other Punctuation | 26 | 0.4% |
| Close Punctuation | 8 | 0.1% |
| Open Punctuation | 8 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 527 | |
| P | 195 | 10.6% |
| C | 148 | 8.0% |
| A | 145 | 7.9% |
| M | 145 | 7.9% |
| B | 139 | 7.5% |
| I | 112 | 6.1% |
| T | 76 | 4.1% |
| D | 74 | 4.0% |
| S | 55 | 3.0% |
| Other values (12) | 230 |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 1067 | |
| e | 595 | |
| n | 579 | |
| p | 560 | |
| o | 557 | |
| r | 74 | 1.9% |
| s | 73 | 1.9% |
| b | 56 | 1.5% |
| l | 39 | 1.0% |
| a | 39 | 1.0% |
| Other values (11) | 166 | 4.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 61 | |
| 6 | 59 | |
| 3 | 39 | |
| 4 | 25 | |
| 2 | 12 | 5.7% |
| 1 | 8 | 3.8% |
| 7 | 4 | 1.9% |
| 9 | 3 | 1.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 14 | |
| @ | 5 | 19.2% |
| , | 5 | 19.2% |
| : | 2 | 7.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 155 | |
| ‐ | 4 | 2.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4 | |
| ] | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4 | |
| [ | 4 |
Space Separator
| Value | Count | Frequency (%) |
| 27 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5651 | |
| Common | 439 | 7.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| d | 1067 | |
| e | 595 | |
| n | 579 | |
| p | 560 | |
| o | 557 | |
| U | 527 | |
| P | 195 | 3.5% |
| C | 148 | 2.6% |
| A | 145 | 2.6% |
| M | 145 | 2.6% |
| Other values (33) | 1133 |
Common
| Value | Count | Frequency (%) |
| - | 155 | |
| 0 | 61 | 13.9% |
| 6 | 59 | 13.4% |
| 3 | 39 | 8.9% |
| 27 | 6.2% | |
| 4 | 25 | 5.7% |
| ; | 14 | 3.2% |
| 2 | 12 | 2.7% |
| 1 | 8 | 1.8% |
| @ | 5 | 1.1% |
| Other values (9) | 34 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6086 | |
| Punctuation | 4 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| d | 1067 | |
| e | 595 | 9.8% |
| n | 579 | 9.5% |
| p | 560 | 9.2% |
| o | 557 | 9.2% |
| U | 527 | 8.7% |
| P | 195 | 3.2% |
| - | 155 | 2.5% |
| C | 148 | 2.4% |
| A | 145 | 2.4% |
| Other values (51) | 1558 |
Punctuation
| Value | Count | Frequency (%) |
| ‐ | 4 |
| Distinct | 870 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| 100.0 | |
|---|---|
| Unknown | |
| 100 | |
| Unknown >> 100.0 | 1345 |
| 150.0 | 901 |
| Other values (865) |
Length
| Max length | 68 |
|---|---|
| Median length | 61 |
| Mean length | 7.4162411 |
| Min length | 2 |
Characters and Unicode
| Total characters | 315168 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 147 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 100 |
|---|---|
| 2nd row | 100 |
| 3rd row | 100 |
| 4th row | 100 |
| 5th row | 100 |
Common Values
| Value | Count | Frequency (%) |
| 100.0 | 14194 | |
| Unknown | 3442 | 8.1% |
| 100 | 2803 | 6.6% |
| Unknown >> 100.0 | 1345 | 3.2% |
| 150.0 | 901 | 2.1% |
| 90.0 | 689 | 1.6% |
| 70.0 >> 70.0 | 683 | 1.6% |
| 70.0 >> 100.0 | 673 | 1.6% |
| 65; 100 | 605 | 1.4% |
| 120.0 | 567 | 1.3% |
| Other values (860) | 16595 |
Length
| Value | Count | Frequency (%) |
| 100.0 | 18459 | |
| 10957 | ||
| unknown | 8536 | |
| 100 | 6300 | 9.1% |
| 70.0 | 4118 | 6.0% |
| 150.0 | 1852 | 2.7% |
| 90.0 | 1462 | 2.1% |
| 70 | 1178 | 1.7% |
| 80.0 | 990 | 1.4% |
| 60 | 889 | 1.3% |
| Other values (139) | 14202 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 104886 | |
| 1 | 34767 | 11.0% |
| . | 34574 | 11.0% |
| 26446 | 8.4% | |
| n | 25608 | 8.1% |
| > | 21450 | 6.8% |
| 5 | 9189 | 2.9% |
| w | 8536 | 2.7% |
| o | 8536 | 2.7% |
| k | 8536 | 2.7% |
| Other values (10) | 32640 | 10.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 168182 | |
| Lowercase Letter | 51216 | 16.3% |
| Other Punctuation | 39106 | 12.4% |
| Space Separator | 26446 | 8.4% |
| Math Symbol | 21682 | 6.9% |
| Uppercase Letter | 8536 | 2.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 104886 | |
| 1 | 34767 | 20.7% |
| 5 | 9189 | 5.5% |
| 7 | 5921 | 3.5% |
| 2 | 3365 | 2.0% |
| 6 | 2834 | 1.7% |
| 9 | 2556 | 1.5% |
| 8 | 2199 | 1.3% |
| 3 | 1408 | 0.8% |
| 4 | 1057 | 0.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 25608 | |
| w | 8536 | 16.7% |
| o | 8536 | 16.7% |
| k | 8536 | 16.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 34574 | |
| ; | 4532 | 11.6% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 21450 | |
| | | 232 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 26446 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 8536 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 255416 | |
| Latin | 59752 | 19.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 104886 | |
| 1 | 34767 | 13.6% |
| . | 34574 | 13.5% |
| 26446 | 10.4% | |
| > | 21450 | 8.4% |
| 5 | 9189 | 3.6% |
| 7 | 5921 | 2.3% |
| ; | 4532 | 1.8% |
| 2 | 3365 | 1.3% |
| 6 | 2834 | 1.1% |
| Other values (5) | 7452 | 2.9% |
Latin
| Value | Count | Frequency (%) |
| n | 25608 | |
| w | 8536 | 14.3% |
| o | 8536 | 14.3% |
| k | 8536 | 14.3% |
| U | 8536 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 315168 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 104886 | |
| 1 | 34767 | 11.0% |
| . | 34574 | 11.0% |
| 26446 | 8.4% | |
| n | 25608 | 8.1% |
| > | 21450 | 6.8% |
| 5 | 9189 | 2.9% |
| w | 8536 | 2.7% |
| o | 8536 | 2.7% |
| k | 8536 | 2.7% |
| Other values (10) | 32640 | 10.4% |
Perovskite_deposition_thermal_annealing_time
Categorical
| Distinct | 766 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| 10.0 | |
|---|---|
| Unknown | |
| 60.0 | |
| 30.0 | |
| 20.0 | 1588 |
| Other values (761) |
Length
| Max length | 67 |
|---|---|
| Median length | 59 |
| Mean length | 7.1448573 |
| Min length | 3 |
Characters and Unicode
| Total characters | 303635 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 120 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 10.0 |
|---|---|
| 2nd row | 10.0 |
| 3rd row | 10.0 |
| 4th row | 10.0 |
| 5th row | 10.0 |
Common Values
| Value | Count | Frequency (%) |
| 10.0 | 7739 | |
| Unknown | 4446 | 10.5% |
| 60.0 | 3851 | 9.1% |
| 30.0 | 2444 | 5.8% |
| 20.0 | 1588 | 3.7% |
| 15.0 | 1571 | 3.7% |
| 5.0 | 1484 | 3.5% |
| 45.0 | 1213 | 2.9% |
| Unknown >> 30.0 | 868 | 2.0% |
| Unknown >> 10.0 | 862 | 2.0% |
| Other values (756) | 16431 |
Length
| Value | Count | Frequency (%) |
| 10.0 | 12898 | |
| 10655 | ||
| unknown | 9800 | |
| 30.0 | 6711 | |
| 60.0 | 5199 | |
| 5.0 | 4155 | 6.1% |
| 15.0 | 3447 | 5.0% |
| 20.0 | 3039 | 4.4% |
| 2.0 | 1794 | 2.6% |
| 1.0 | 1714 | 2.5% |
| Other values (104) | 8889 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 80368 | |
| . | 47846 | |
| n | 29400 | 9.7% |
| 25804 | 8.5% | |
| > | 20864 | 6.9% |
| 1 | 19939 | 6.6% |
| 5 | 10646 | 3.5% |
| U | 9800 | 3.2% |
| k | 9800 | 3.2% |
| o | 9800 | 3.2% |
| Other values (10) | 39368 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 135804 | |
| Lowercase Letter | 58800 | |
| Other Punctuation | 52340 | 17.2% |
| Space Separator | 25804 | 8.5% |
| Math Symbol | 21087 | 6.9% |
| Uppercase Letter | 9800 | 3.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 80368 | |
| 1 | 19939 | 14.7% |
| 5 | 10646 | 7.8% |
| 3 | 7862 | 5.8% |
| 2 | 6655 | 4.9% |
| 6 | 5593 | 4.1% |
| 4 | 2802 | 2.1% |
| 9 | 1052 | 0.8% |
| 8 | 495 | 0.4% |
| 7 | 392 | 0.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 29400 | |
| k | 9800 | 16.7% |
| o | 9800 | 16.7% |
| w | 9800 | 16.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 47846 | |
| ; | 4494 | 8.6% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 20864 | |
| | | 223 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 25804 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 9800 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 235035 | |
| Latin | 68600 | 22.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 80368 | |
| . | 47846 | |
| 25804 | 11.0% | |
| > | 20864 | 8.9% |
| 1 | 19939 | 8.5% |
| 5 | 10646 | 4.5% |
| 3 | 7862 | 3.3% |
| 2 | 6655 | 2.8% |
| 6 | 5593 | 2.4% |
| ; | 4494 | 1.9% |
| Other values (5) | 4964 | 2.1% |
Latin
| Value | Count | Frequency (%) |
| n | 29400 | |
| U | 9800 | 14.3% |
| k | 9800 | 14.3% |
| o | 9800 | 14.3% |
| w | 9800 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 303635 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 80368 | |
| . | 47846 | |
| n | 29400 | 9.7% |
| 25804 | 8.5% | |
| > | 20864 | 6.9% |
| 1 | 19939 | 6.6% |
| 5 | 10646 | 3.5% |
| U | 9800 | 3.2% |
| k | 9800 | 3.2% |
| o | 9800 | 3.2% |
| Other values (10) | 39368 |
Perovskite_deposition_thermal_annealing_atmosphere
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| Unknown | |
|---|---|
| N2 | 733 |
| Air | 187 |
| Unknown >> Unknown | 82 |
| Ambient | 65 |
| Other values (26) | 280 |
Length
| Max length | 56 |
|---|---|
| Median length | 7 |
| Mean length | 6.9419488 |
| Min length | 2 |
Characters and Unicode
| Total characters | 295012 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unknown |
|---|---|
| 2nd row | Unknown |
| 3rd row | Unknown |
| 4th row | Unknown |
| 5th row | Unknown |
Common Values
| Value | Count | Frequency (%) |
| Unknown | 41150 | |
| N2 | 733 | 1.7% |
| Air | 187 | 0.4% |
| Unknown >> Unknown | 82 | 0.2% |
| Ambient | 65 | 0.2% |
| N2 >> N2 | 44 | 0.1% |
| N2; Ambient | 40 | 0.1% |
| Ar | 38 | 0.1% |
| Air >> Air | 28 | 0.1% |
| Unknown >> Air | 18 | < 0.1% |
| Other values (21) | 112 | 0.3% |
Length
| Value | Count | Frequency (%) |
| unknown | 41351 | |
| n2 | 983 | 2.3% |
| air | 341 | 0.8% |
| 318 | 0.7% | |
| ambient | 105 | 0.2% |
| ar | 54 | 0.1% |
| vacuum | 40 | 0.1% |
| dry | 23 | 0.1% |
| 70 | 3 | < 0.1% |
| o2 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 124158 | |
| U | 41351 | 14.0% |
| k | 41351 | 14.0% |
| o | 41351 | 14.0% |
| w | 41351 | 14.0% |
| 2 | 984 | 0.3% |
| N | 983 | 0.3% |
| 722 | 0.2% | |
| > | 606 | 0.2% |
| A | 477 | 0.2% |
| Other values (17) | 1678 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 249741 | |
| Uppercase Letter | 42875 | 14.5% |
| Decimal Number | 990 | 0.3% |
| Space Separator | 722 | 0.2% |
| Math Symbol | 621 | 0.2% |
| Other Punctuation | 63 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 124158 | |
| k | 41351 | 16.6% |
| o | 41351 | 16.6% |
| w | 41351 | 16.6% |
| i | 446 | 0.2% |
| r | 418 | 0.2% |
| m | 145 | 0.1% |
| t | 105 | < 0.1% |
| b | 105 | < 0.1% |
| e | 105 | < 0.1% |
| Other values (4) | 206 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 41351 | |
| N | 983 | 2.3% |
| A | 477 | 1.1% |
| V | 40 | 0.1% |
| D | 23 | 0.1% |
| O | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 984 | |
| 7 | 3 | 0.3% |
| 0 | 3 | 0.3% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 606 | |
| | | 15 | 2.4% |
Space Separator
| Value | Count | Frequency (%) |
| 722 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 63 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 292616 | |
| Common | 2396 | 0.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 124158 | |
| U | 41351 | 14.1% |
| k | 41351 | 14.1% |
| o | 41351 | 14.1% |
| w | 41351 | 14.1% |
| N | 983 | 0.3% |
| A | 477 | 0.2% |
| i | 446 | 0.2% |
| r | 418 | 0.1% |
| m | 145 | < 0.1% |
| Other values (10) | 585 | 0.2% |
Common
| Value | Count | Frequency (%) |
| 2 | 984 | |
| 722 | ||
| > | 606 | |
| ; | 63 | 2.6% |
| | | 15 | 0.6% |
| 7 | 3 | 0.1% |
| 0 | 3 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 295012 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 124158 | |
| U | 41351 | 14.0% |
| k | 41351 | 14.0% |
| o | 41351 | 14.0% |
| w | 41351 | 14.0% |
| 2 | 984 | 0.3% |
| N | 983 | 0.3% |
| 722 | 0.2% | |
| > | 606 | 0.2% |
| A | 477 | 0.2% |
| Other values (17) | 1678 | 0.6% |
Perovskite_deposition_solvent_annealing
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.6 KiB |
| False | |
|---|---|
| True | 890 |
| Value | Count | Frequency (%) |
| False | 41607 | |
| True | 890 | 2.1% |
Perovskite_deposition_solvent_annealing_solvent_atmosphere
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 38 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| Unknown | |
|---|---|
| DMF | 222 |
| DMSO | 214 |
| DMF; DMSO | 29 |
| Vacuum | 27 |
| Other values (33) | 208 |
Length
| Max length | 26 |
|---|---|
| Median length | 7 |
| Mean length | 6.9736452 |
| Min length | 3 |
Characters and Unicode
| Total characters | 296359 |
|---|---|
| Distinct characters | 44 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unknown |
|---|---|
| 2nd row | Unknown |
| 3rd row | Unknown |
| 4th row | Unknown |
| 5th row | Unknown |
Common Values
| Value | Count | Frequency (%) |
| Unknown | 41797 | |
| DMF | 222 | 0.5% |
| DMSO | 214 | 0.5% |
| DMF; DMSO | 29 | 0.1% |
| Vacuum | 27 | 0.1% |
| Chlorobenzene; DMF | 24 | 0.1% |
| HCl | 21 | < 0.1% |
| H2O | 19 | < 0.1% |
| IPA | 17 | < 0.1% |
| MACl | 15 | < 0.1% |
| Other values (28) | 112 | 0.3% |
Length
| Value | Count | Frequency (%) |
| unknown | 41797 | |
| dmf | 282 | 0.7% |
| dmso | 263 | 0.6% |
| chlorobenzene | 42 | 0.1% |
| vacuum | 27 | 0.1% |
| h2o | 24 | 0.1% |
| ipa | 24 | 0.1% |
| hcl | 21 | < 0.1% |
| macl | 15 | < 0.1% |
| air | 10 | < 0.1% |
| Other values (20) | 89 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 125545 | |
| o | 41928 | 14.1% |
| U | 41797 | 14.1% |
| k | 41797 | 14.1% |
| w | 41797 | 14.1% |
| M | 583 | 0.2% |
| D | 552 | 0.2% |
| O | 287 | 0.1% |
| F | 282 | 0.1% |
| S | 263 | 0.1% |
| Other values (34) | 1528 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 252049 | |
| Uppercase Letter | 44091 | 14.9% |
| Space Separator | 97 | < 0.1% |
| Other Punctuation | 88 | < 0.1% |
| Decimal Number | 30 | < 0.1% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 125545 | |
| o | 41928 | 16.6% |
| k | 41797 | 16.6% |
| w | 41797 | 16.6% |
| e | 207 | 0.1% |
| l | 147 | 0.1% |
| h | 84 | < 0.1% |
| a | 81 | < 0.1% |
| r | 67 | < 0.1% |
| u | 65 | < 0.1% |
| Other values (11) | 331 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 41797 | |
| M | 583 | 1.3% |
| D | 552 | 1.3% |
| O | 287 | 0.7% |
| F | 282 | 0.6% |
| S | 263 | 0.6% |
| C | 82 | 0.2% |
| A | 54 | 0.1% |
| H | 50 | 0.1% |
| P | 34 | 0.1% |
| Other values (8) | 107 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 24 | |
| 4 | 6 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 97 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 88 |
Dash Punctuation
| Value | Count | Frequency (%) |
| ‐ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 296140 | |
| Common | 219 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 125545 | |
| o | 41928 | 14.2% |
| U | 41797 | 14.1% |
| k | 41797 | 14.1% |
| w | 41797 | 14.1% |
| M | 583 | 0.2% |
| D | 552 | 0.2% |
| O | 287 | 0.1% |
| F | 282 | 0.1% |
| S | 263 | 0.1% |
| Other values (29) | 1309 | 0.4% |
Common
| Value | Count | Frequency (%) |
| 97 | ||
| ; | 88 | |
| 2 | 24 | 11.0% |
| 4 | 6 | 2.7% |
| ‐ | 4 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 296355 | |
| Punctuation | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 125545 | |
| o | 41928 | 14.1% |
| U | 41797 | 14.1% |
| k | 41797 | 14.1% |
| w | 41797 | 14.1% |
| M | 583 | 0.2% |
| D | 552 | 0.2% |
| O | 287 | 0.1% |
| F | 282 | 0.1% |
| S | 263 | 0.1% |
| Other values (33) | 1524 | 0.5% |
Punctuation
| Value | Count | Frequency (%) |
| ‐ | 4 |
Perovskite_deposition_after_treatment_of_formed_perovskite
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 103 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 40873 |
| Missing (%) | 96.2% |
| Memory size | 332.1 KiB |
| Washed with IPA | |
|---|---|
| Dried under flow of clean air | 51 |
| UV radiation | 29 |
| Washed with Methyl acetate | 23 |
| Washed with IPA >> Washed with Dichloromethane | 21 |
| Other values (98) |
Length
| Max length | 106 |
|---|---|
| Median length | 15 |
| Mean length | 18.455049 |
| Min length | 4 |
Characters and Unicode
| Total characters | 29971 |
|---|---|
| Distinct characters | 61 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 23 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | Washed with IPA |
|---|---|
| 2nd row | Washed with IPA |
| 3rd row | Washed with IPA |
| 4th row | Washed with IPA |
| 5th row | Washed with IPA |
Common Values
| Value | Count | Frequency (%) |
| Washed with IPA | 1078 | 2.5% |
| Dried under flow of clean air | 51 | 0.1% |
| UV radiation | 29 | 0.1% |
| Washed with Methyl acetate | 23 | 0.1% |
| Washed with IPA >> Washed with Dichloromethane | 21 | < 0.1% |
| Ultrasonic transducer | 16 | < 0.1% |
| Light soaking | 14 | < 0.1% |
| Intense pulsed light annealing | 13 | < 0.1% |
| Washed with Ether | 13 | < 0.1% |
| Washed with Toluene | 13 | < 0.1% |
| Other values (93) | 353 | 0.8% |
| (Missing) | 40873 |
Length
| Value | Count | Frequency (%) |
| with | 1252 | |
| washed | 1240 | |
| ipa | 1132 | |
| in | 73 | 1.4% |
| of | 71 | 1.3% |
| annealing | 71 | 1.3% |
| under | 67 | 1.3% |
| flow | 66 | 1.2% |
| air | 64 | 1.2% |
| radiation | 64 | 1.2% |
| Other values (152) | 1219 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3705 | ||
| h | 2738 | 9.1% |
| e | 2326 | 7.8% |
| i | 2170 | 7.2% |
| a | 2168 | 7.2% |
| t | 1927 | 6.4% |
| d | 1642 | 5.5% |
| s | 1557 | 5.2% |
| w | 1350 | 4.5% |
| W | 1240 | 4.1% |
| Other values (51) | 9148 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20546 | |
| Uppercase Letter | 5465 | 18.2% |
| Space Separator | 3705 | 12.4% |
| Math Symbol | 120 | 0.4% |
| Decimal Number | 55 | 0.2% |
| Dash Punctuation | 43 | 0.1% |
| Other Punctuation | 23 | 0.1% |
| Close Punctuation | 7 | < 0.1% |
| Open Punctuation | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| h | 2738 | |
| e | 2326 | |
| i | 2170 | |
| a | 2168 | |
| t | 1927 | |
| d | 1642 | |
| s | 1557 | |
| w | 1350 | |
| n | 1042 | 5.1% |
| o | 708 | 3.4% |
| Other values (14) | 2918 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 1240 | |
| I | 1208 | |
| A | 1205 | |
| P | 1159 | |
| D | 151 | 2.8% |
| S | 66 | 1.2% |
| M | 61 | 1.1% |
| U | 56 | 1.0% |
| E | 47 | 0.9% |
| V | 45 | 0.8% |
| Other values (10) | 227 | 4.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 25 | |
| 1 | 11 | |
| 5 | 10 | 18.2% |
| 3 | 4 | 7.3% |
| 0 | 4 | 7.3% |
| 4 | 1 | 1.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 7 | |
| @ | 7 | |
| : | 4 | |
| ; | 2 | 8.7% |
| , | 2 | 8.7% |
| . | 1 | 4.3% |
Space Separator
| Value | Count | Frequency (%) |
| 3705 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 120 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 43 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 7 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 26011 | |
| Common | 3960 | 13.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| h | 2738 | 10.5% |
| e | 2326 | 8.9% |
| i | 2170 | 8.3% |
| a | 2168 | 8.3% |
| t | 1927 | 7.4% |
| d | 1642 | 6.3% |
| s | 1557 | 6.0% |
| w | 1350 | 5.2% |
| W | 1240 | 4.8% |
| I | 1208 | 4.6% |
| Other values (34) | 7685 |
Common
| Value | Count | Frequency (%) |
| 3705 | ||
| > | 120 | 3.0% |
| - | 43 | 1.1% |
| 2 | 25 | 0.6% |
| 1 | 11 | 0.3% |
| 5 | 10 | 0.3% |
| ) | 7 | 0.2% |
| / | 7 | 0.2% |
| ( | 7 | 0.2% |
| @ | 7 | 0.2% |
| Other values (7) | 18 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29971 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3705 | ||
| h | 2738 | 9.1% |
| e | 2326 | 7.8% |
| i | 2170 | 7.2% |
| a | 2168 | 7.2% |
| t | 1927 | 6.4% |
| d | 1642 | 5.5% |
| s | 1557 | 5.2% |
| w | 1350 | 4.5% |
| W | 1240 | 4.1% |
| Other values (51) | 9148 |
Perovskite_surface_treatment_before_next_deposition_step
Categorical
HIGH CORRELATION  MISSING  UNIFORM 
| Distinct | 2 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 42493 |
| Missing (%) | > 99.9% |
| Memory size | 332.1 KiB |
| Ar plasma | |
|---|---|
| UV |
Length
| Max length | 9 |
|---|---|
| Median length | 5.5 |
| Mean length | 5.5 |
| Min length | 2 |
Characters and Unicode
| Total characters | 22 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ar plasma |
|---|---|
| 2nd row | Ar plasma |
| 3rd row | UV |
| 4th row | UV |
Common Values
| Value | Count | Frequency (%) |
| Ar plasma | 2 | < 0.1% |
| UV | 2 | < 0.1% |
| (Missing) | 42493 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ar | 2 | |
| plasma | 2 | |
| uv | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| A | 2 | |
| r | 2 | |
| 2 | ||
| p | 2 | |
| l | 2 | |
| s | 2 | |
| m | 2 | |
| U | 2 | |
| V | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14 | |
| Uppercase Letter | 6 | |
| Space Separator | 2 | 9.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| r | 2 | |
| p | 2 | |
| l | 2 | |
| s | 2 | |
| m | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| U | 2 | |
| V | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20 | |
| Common | 2 | 9.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| A | 2 | |
| r | 2 | |
| p | 2 | |
| l | 2 | |
| s | 2 | |
| m | 2 | |
| U | 2 | |
| V | 2 |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| A | 2 | |
| r | 2 | |
| 2 | ||
| p | 2 | |
| l | 2 | |
| s | 2 | |
| m | 2 | |
| U | 2 | |
| V | 2 |
HTL_stack_sequence
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 1959 |
|---|---|
| Distinct (%) | 4.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| Spiro-MeOTAD | |
|---|---|
| PEDOT:PSS | |
| none | |
| PTAA | 1854 |
| NiO-c | 1700 |
| Other values (1954) |
Length
| Max length | 231 |
|---|---|
| Median length | 183 |
| Mean length | 10.144904 |
| Min length | 1 |
Characters and Unicode
| Total characters | 431128 |
|---|---|
| Distinct characters | 83 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 928 ? |
|---|---|
| Unique (%) | 2.2% |
Sample
| 1st row | Spiro-MeOTAD |
|---|---|
| 2nd row | Spiro-MeOTAD |
| 3rd row | Spiro-MeOTAD |
| 4th row | Spiro-MeOTAD |
| 5th row | Spiro-MeOTAD |
Common Values
| Value | Count | Frequency (%) |
| Spiro-MeOTAD | 20918 | |
| PEDOT:PSS | 6591 | 15.5% |
| none | 2626 | 6.2% |
| PTAA | 1854 | 4.4% |
| NiO-c | 1700 | 4.0% |
| P3HT | 885 | 2.1% |
| NiO-np | 411 | 1.0% |
| CuSCN | 220 | 0.5% |
| NiMgLiO | 160 | 0.4% |
| NiO | 160 | 0.4% |
| Other values (1949) | 6972 | 16.4% |
Length
| Value | Count | Frequency (%) |
| spiro-meotad | 21617 | |
| pedot:pss | 7214 | 15.1% |
| none | 2626 | 5.5% |
| ptaa | 2167 | 4.5% |
| 2163 | 4.5% | |
| nio-c | 1985 | 4.2% |
| p3ht | 988 | 2.1% |
| nio-np | 476 | 1.0% |
| cuscn | 267 | 0.6% |
| moo3 | 214 | 0.4% |
| Other values (1730) | 8010 | 16.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 37397 | 8.7% |
| T | 35663 | 8.3% |
| O | 33382 | 7.7% |
| D | 30380 | 7.0% |
| - | 28268 | 6.6% |
| A | 27443 | 6.4% |
| e | 26831 | 6.2% |
| i | 26520 | 6.2% |
| o | 26294 | 6.1% |
| p | 23343 | 5.4% |
| Other values (73) | 135607 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 230202 | |
| Lowercase Letter | 147843 | |
| Dash Punctuation | 28358 | 6.6% |
| Other Punctuation | 8984 | 2.1% |
| Decimal Number | 6982 | 1.6% |
| Space Separator | 5228 | 1.2% |
| Math Symbol | 2145 | 0.5% |
| Open Punctuation | 691 | 0.2% |
| Close Punctuation | 684 | 0.2% |
| Final Punctuation | 5 | < 0.1% |
| Other values (2) | 6 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 37397 | |
| T | 35663 | |
| O | 33382 | |
| D | 30380 | |
| A | 27443 | |
| M | 23038 | |
| P | 21598 | |
| E | 7684 | 3.3% |
| N | 4458 | 1.9% |
| C | 2795 | 1.2% |
| Other values (16) | 6364 | 2.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 26831 | |
| i | 26520 | |
| o | 26294 | |
| p | 23343 | |
| r | 22779 | |
| n | 7704 | 5.2% |
| c | 2763 | 1.9% |
| l | 1468 | 1.0% |
| a | 1405 | 1.0% |
| h | 1275 | 0.9% |
| Other values (15) | 7461 | 5.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2251 | |
| 2 | 1581 | |
| 1 | 877 | 12.6% |
| 4 | 678 | 9.7% |
| 0 | 376 | 5.4% |
| 5 | 343 | 4.9% |
| 6 | 325 | 4.7% |
| 7 | 212 | 3.0% |
| 9 | 198 | 2.8% |
| 8 | 141 | 2.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 7399 | |
| , | 720 | 8.0% |
| ; | 500 | 5.6% |
| ' | 160 | 1.8% |
| ′ | 86 | 1.0% |
| @ | 67 | 0.7% |
| . | 52 | 0.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 28268 | |
| ‐ | 87 | 0.3% |
| – | 3 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 496 | |
| [ | 192 | 27.8% |
| { | 3 | 0.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 492 | |
| ] | 189 | 27.6% |
| } | 3 | 0.4% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 1 | |
| ` | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 5228 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 2145 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 5 |
Control
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 378045 | |
| Common | 53083 | 12.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 37397 | |
| T | 35663 | |
| O | 33382 | 8.8% |
| D | 30380 | 8.0% |
| A | 27443 | 7.3% |
| e | 26831 | 7.1% |
| i | 26520 | 7.0% |
| o | 26294 | 7.0% |
| p | 23343 | 6.2% |
| M | 23038 | 6.1% |
| Other values (41) | 87754 |
Common
| Value | Count | Frequency (%) |
| - | 28268 | |
| : | 7399 | 13.9% |
| 5228 | 9.8% | |
| 3 | 2251 | 4.2% |
| | | 2145 | 4.0% |
| 2 | 1581 | 3.0% |
| 1 | 877 | 1.7% |
| , | 720 | 1.4% |
| 4 | 678 | 1.3% |
| ; | 500 | 0.9% |
| Other values (22) | 3436 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 430946 | |
| Punctuation | 181 | < 0.1% |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 37397 | 8.7% |
| T | 35663 | 8.3% |
| O | 33382 | 7.7% |
| D | 30380 | 7.0% |
| - | 28268 | 6.6% |
| A | 27443 | 6.4% |
| e | 26831 | 6.2% |
| i | 26520 | 6.2% |
| o | 26294 | 6.1% |
| p | 23343 | 5.4% |
| Other values (68) | 135425 |
Punctuation
| Value | Count | Frequency (%) |
| ‐ | 87 | |
| ′ | 86 | |
| ’ | 5 | 2.8% |
| – | 3 | 1.7% |
None
| Value | Count | Frequency (%) |
| ´ | 1 |
HTL_thickness_list
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 439 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 32425 |
| Missing (%) | 76.3% |
| Memory size | 332.1 KiB |
| 200.0 | |
|---|---|
| 40.0 | |
| 150.0 | |
| 30.0 | |
| 100.0 | 550 |
| Other values (434) |
Length
| Max length | 16 |
|---|---|
| Median length | 15 |
| Mean length | 5.0764496 |
| Min length | 3 |
Characters and Unicode
| Total characters | 51130 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 181 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | 50.0 |
|---|---|
| 2nd row | 50.0 |
| 3rd row | 50.0 |
| 4th row | 50.0 |
| 5th row | 70.0 | nan |
Common Values
| Value | Count | Frequency (%) |
| 200.0 | 1132 | 2.7% |
| 40.0 | 936 | 2.2% |
| 150.0 | 704 | 1.7% |
| 30.0 | 682 | 1.6% |
| 100.0 | 550 | 1.3% |
| 50.0 | 496 | 1.2% |
| 20.0 | 487 | 1.1% |
| 250.0 | 294 | 0.7% |
| 35.0 | 227 | 0.5% |
| 60.0 | 226 | 0.5% |
| Other values (429) | 4338 | 10.2% |
| (Missing) | 32425 |
Length
| Value | Count | Frequency (%) |
| 200.0 | 1187 | 9.9% |
| 40.0 | 1027 | 8.6% |
| 944 | 7.9% | |
| 150.0 | 738 | 6.2% |
| 30.0 | 725 | 6.1% |
| 20.0 | 593 | 5.0% |
| 100.0 | 581 | 4.9% |
| 50.0 | 545 | 4.6% |
| 10.0 | 516 | 4.3% |
| nan | 482 | 4.0% |
| Other values (250) | 4622 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 21377 | |
| . | 10527 | |
| 1 | 3363 | 6.6% |
| 2 | 3282 | 6.4% |
| 5 | 3050 | 6.0% |
| 1888 | 3.7% | |
| 3 | 1630 | 3.2% |
| 4 | 1535 | 3.0% |
| n | 976 | 1.9% |
| | | 944 | 1.8% |
| Other values (12) | 2558 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 36285 | |
| Other Punctuation | 10527 | 20.6% |
| Space Separator | 1888 | 3.7% |
| Lowercase Letter | 1483 | 2.9% |
| Math Symbol | 944 | 1.8% |
| Uppercase Letter | 3 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 21377 | |
| 1 | 3363 | 9.3% |
| 2 | 3282 | 9.0% |
| 5 | 3050 | 8.4% |
| 3 | 1630 | 4.5% |
| 4 | 1535 | 4.2% |
| 8 | 682 | 1.9% |
| 6 | 594 | 1.6% |
| 7 | 514 | 1.4% |
| 9 | 258 | 0.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 976 | |
| a | 482 | |
| u | 7 | 0.5% |
| k | 4 | 0.3% |
| o | 4 | 0.3% |
| w | 4 | 0.3% |
| r | 3 | 0.2% |
| e | 3 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 10527 |
Space Separator
| Value | Count | Frequency (%) |
| 1888 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 944 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 49644 | |
| Latin | 1486 | 2.9% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 21377 | |
| . | 10527 | |
| 1 | 3363 | 6.8% |
| 2 | 3282 | 6.6% |
| 5 | 3050 | 6.1% |
| 1888 | 3.8% | |
| 3 | 1630 | 3.3% |
| 4 | 1535 | 3.1% |
| | | 944 | 1.9% |
| 8 | 682 | 1.4% |
| Other values (3) | 1366 | 2.8% |
Latin
| Value | Count | Frequency (%) |
| n | 976 | |
| a | 482 | |
| u | 7 | 0.5% |
| k | 4 | 0.3% |
| o | 4 | 0.3% |
| w | 4 | 0.3% |
| T | 3 | 0.2% |
| r | 3 | 0.2% |
| e | 3 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51130 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 21377 | |
| . | 10527 | |
| 1 | 3363 | 6.6% |
| 2 | 3282 | 6.4% |
| 5 | 3050 | 6.0% |
| 1888 | 3.7% | |
| 3 | 1630 | 3.2% |
| 4 | 1535 | 3.0% |
| n | 976 | 1.9% |
| | | 944 | 1.8% |
| Other values (12) | 2558 | 5.0% |
HTL_additives_compounds
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 380 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 18371 |
| Missing (%) | 43.2% |
| Memory size | 332.1 KiB |
| Li-TFSI; TBP | |
|---|---|
| FK209; Li-TFSI; TBP | |
| Undoped | 422 |
| FK102; Li-TFSI; TBP | 388 |
| F4-TCNQ | 322 |
| Other values (375) |
Length
| Max length | 71 |
|---|---|
| Median length | 12 |
| Mean length | 13.158543 |
| Min length | 1 |
Characters and Unicode
| Total characters | 317463 |
|---|---|
| Distinct characters | 74 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 120 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Li(CF3SO2)2N; TBP |
|---|---|
| 2nd row | Li(CF3SO2)2N; TBP |
| 3rd row | Li(CF3SO2)2N; TBP |
| 4th row | Li(CF3SO2)2N; TBP |
| 5th row | Li(CF3SO2)2N; TBP |
Common Values
| Value | Count | Frequency (%) |
| Li-TFSI; TBP | 16635 | |
| FK209; Li-TFSI; TBP | 3475 | 8.2% |
| Undoped | 422 | 1.0% |
| FK102; Li-TFSI; TBP | 388 | 0.9% |
| F4-TCNQ | 322 | 0.8% |
| Unknown | Li-TFSI; TBP | 243 | 0.6% |
| Li-TFSI | 209 | 0.5% |
| Cu | 202 | 0.5% |
| Unknown | FK209; Li-TFSI; TBP | 108 | 0.3% |
| Li-TFSI; TBP; FK209 | 102 | 0.2% |
| Other values (370) | 2020 | 4.8% |
| (Missing) | 18371 |
Length
| Value | Count | Frequency (%) |
| li-tfsi | 21799 | |
| tbp | 21653 | |
| fk209 | 3787 | 7.3% |
| 774 | 1.5% | |
| unknown | 650 | 1.2% |
| undoped | 533 | 1.0% |
| fk102 | 436 | 0.8% |
| f4-tcnq | 379 | 0.7% |
| cu | 239 | 0.5% |
| co | 66 | 0.1% |
| Other values (304) | 1801 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 44306 | |
| 27991 | ||
| F | 26859 | |
| ; | 26258 | |
| - | 22677 | 7.1% |
| i | 22414 | 7.1% |
| S | 22161 | 7.0% |
| I | 22036 | 6.9% |
| P | 22034 | 6.9% |
| L | 21971 | 6.9% |
| Other values (64) | 58756 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 190139 | |
| Lowercase Letter | 35035 | 11.0% |
| Space Separator | 27991 | 8.8% |
| Other Punctuation | 26364 | 8.3% |
| Dash Punctuation | 22677 | 7.1% |
| Decimal Number | 14009 | 4.4% |
| Math Symbol | 780 | 0.2% |
| Open Punctuation | 234 | 0.1% |
| Close Punctuation | 234 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 44306 | |
| F | 26859 | |
| S | 22161 | |
| I | 22036 | |
| P | 22034 | |
| L | 21971 | |
| B | 21874 | |
| K | 4280 | 2.3% |
| C | 1192 | 0.6% |
| U | 1191 | 0.6% |
| Other values (16) | 2235 | 1.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 22414 | |
| n | 2970 | 8.5% |
| o | 1801 | 5.1% |
| d | 1324 | 3.8% |
| e | 1081 | 3.1% |
| p | 763 | 2.2% |
| k | 668 | 1.9% |
| w | 650 | 1.9% |
| u | 486 | 1.4% |
| a | 472 | 1.3% |
| Other values (14) | 2406 | 6.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4606 | |
| 0 | 4265 | |
| 9 | 3822 | |
| 4 | 482 | 3.4% |
| 1 | 472 | 3.4% |
| 3 | 225 | 1.6% |
| 6 | 87 | 0.6% |
| 5 | 36 | 0.3% |
| 8 | 14 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 26258 | |
| , | 47 | 0.2% |
| @ | 30 | 0.1% |
| ′ | 18 | 0.1% |
| : | 7 | < 0.1% |
| . | 3 | < 0.1% |
| * | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 774 | |
| + | 6 | 0.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 227 | |
| [ | 7 | 3.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 227 | |
| ] | 7 | 3.0% |
Space Separator
| Value | Count | Frequency (%) |
| 27991 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 22677 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 225174 | |
| Common | 92289 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 44306 | |
| F | 26859 | |
| i | 22414 | |
| S | 22161 | |
| I | 22036 | |
| P | 22034 | |
| L | 21971 | |
| B | 21874 | |
| K | 4280 | 1.9% |
| n | 2970 | 1.3% |
| Other values (40) | 14269 | 6.3% |
Common
| Value | Count | Frequency (%) |
| 27991 | ||
| ; | 26258 | |
| - | 22677 | |
| 2 | 4606 | 5.0% |
| 0 | 4265 | 4.6% |
| 9 | 3822 | 4.1% |
| | | 774 | 0.8% |
| 4 | 482 | 0.5% |
| 1 | 472 | 0.5% |
| ( | 227 | 0.2% |
| Other values (14) | 715 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 317445 | |
| Punctuation | 18 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 44306 | |
| 27991 | ||
| F | 26859 | |
| ; | 26258 | |
| - | 22677 | 7.1% |
| i | 22414 | 7.1% |
| S | 22161 | 7.0% |
| I | 22036 | 6.9% |
| P | 22034 | 6.9% |
| L | 21971 | 6.9% |
| Other values (63) | 58738 |
Punctuation
| Value | Count | Frequency (%) |
| ′ | 18 |
HTL_additives_concentrations
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 223 |
|---|---|
| Distinct (%) | 21.5% |
| Missing | 41462 |
| Missing (%) | 97.6% |
| Memory size | 332.1 KiB |
| 9.1 mg/ml; 0.029 ml/ml | 51 |
|---|---|
| 14.2 mM; 8 vol% | 40 |
| 17.5 uL(520mg/mLACN); 28.8 uL | 34 |
| nan | 2 uL/mL | 30 |
| 9.1 mg/ml; 28.8 µl/ml | 25 |
| Other values (218) |
Length
| Max length | 52 |
|---|---|
| Median length | 35 |
| Mean length | 20.761353 |
| Min length | 3 |
Characters and Unicode
| Total characters | 21488 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 73 ? |
|---|---|
| Unique (%) | 7.1% |
Sample
| 1st row | 2 wt% |
|---|---|
| 2nd row | 5 wt% |
| 3rd row | 10 wt% |
| 4th row | 0.5 vol%; nan; nan |
| 5th row | 1 vol%; nan; nan |
Common Values
| Value | Count | Frequency (%) |
| 9.1 mg/ml; 0.029 ml/ml | 51 | 0.1% |
| 14.2 mM; 8 vol% | 40 | 0.1% |
| 17.5 uL(520mg/mLACN); 28.8 uL | 34 | 0.1% |
| nan | 2 uL/mL | 30 | 0.1% |
| 9.1 mg/ml; 28.8 µl/ml | 25 | 0.1% |
| 9.1 mg/ml; 0.0288 ml/ml | 22 | 0.1% |
| 520 mg/ml; 0.0338 vol% | 20 | < 0.1% |
| 5.2 mg/ml; 0.02 ml/ml | 17 | < 0.1% |
| 22.5 uL; 15 uL | 16 | < 0.1% |
| 520 mg/ml; 2.88 vol% | 15 | < 0.1% |
| Other values (213) | 765 | 1.8% |
| (Missing) | 41462 |
Length
| Value | Count | Frequency (%) |
| mg/ml | 438 | 10.4% |
| vol | 345 | 8.2% |
| ul | 216 | 5.1% |
| mm | 164 | 3.9% |
| ml/ml | 154 | 3.7% |
| 9.1 | 147 | 3.5% |
| nan | 144 | 3.4% |
| 28.8 | 125 | 3.0% |
| 107 | 2.5% | |
| µl/ml | 107 | 2.5% |
| Other values (183) | 2271 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3183 | ||
| m | 2062 | 9.6% |
| 0 | 1364 | 6.3% |
| l | 1357 | 6.3% |
| . | 1264 | 5.9% |
| ; | 1077 | 5.0% |
| / | 995 | 4.6% |
| 2 | 946 | 4.4% |
| L | 809 | 3.8% |
| 1 | 715 | 3.3% |
| Other values (26) | 7716 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6084 | |
| Decimal Number | 5959 | |
| Other Punctuation | 3871 | |
| Space Separator | 3183 | |
| Uppercase Letter | 1813 | 8.4% |
| Open Punctuation | 261 | 1.2% |
| Close Punctuation | 261 | 1.2% |
| Math Symbol | 56 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 2062 | |
| l | 1357 | |
| g | 696 | 11.4% |
| u | 523 | 8.6% |
| o | 406 | 6.7% |
| v | 345 | 5.7% |
| n | 288 | 4.7% |
| a | 150 | 2.5% |
| µ | 107 | 1.8% |
| t | 78 | 1.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1364 | |
| 2 | 946 | |
| 1 | 715 | |
| 5 | 711 | |
| 8 | 708 | |
| 3 | 487 | 8.2% |
| 9 | 342 | 5.7% |
| 7 | 295 | 5.0% |
| 4 | 206 | 3.5% |
| 6 | 185 | 3.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 809 | |
| C | 249 | 13.7% |
| A | 247 | 13.6% |
| N | 247 | 13.6% |
| M | 245 | 13.5% |
| K | 14 | 0.8% |
| B | 2 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1264 | |
| ; | 1077 | |
| / | 995 | |
| % | 535 |
Space Separator
| Value | Count | Frequency (%) |
| 3183 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 261 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 261 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 56 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13698 | |
| Latin | 7790 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3183 | ||
| 0 | 1364 | |
| . | 1264 | 9.2% |
| ; | 1077 | 7.9% |
| / | 995 | 7.3% |
| 2 | 946 | 6.9% |
| 1 | 715 | 5.2% |
| 5 | 711 | 5.2% |
| 8 | 708 | 5.2% |
| % | 535 | 3.9% |
| Other values (9) | 2200 |
Latin
| Value | Count | Frequency (%) |
| m | 2062 | |
| l | 1357 | |
| L | 809 | 10.4% |
| g | 696 | 8.9% |
| u | 523 | 6.7% |
| o | 406 | 5.2% |
| v | 345 | 4.4% |
| n | 288 | 3.7% |
| C | 249 | 3.2% |
| A | 247 | 3.2% |
| Other values (7) | 808 | 10.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21381 | |
| None | 107 | 0.5% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3183 | ||
| m | 2062 | 9.6% |
| 0 | 1364 | 6.4% |
| l | 1357 | 6.3% |
| . | 1264 | 5.9% |
| ; | 1077 | 5.0% |
| / | 995 | 4.7% |
| 2 | 946 | 4.4% |
| L | 809 | 3.8% |
| 1 | 715 | 3.3% |
| Other values (25) | 7609 |
None
| Value | Count | Frequency (%) |
| µ | 107 |
HTL_deposition_procedure
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 120 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| Spin-coating | |
|---|---|
| Unknown | 2906 |
| Spin-coating | Spin-coating | 1342 |
| Evaporation | 351 |
| Spray-pyrolys | 256 |
| Other values (115) | 1913 |
Length
| Max length | 91 |
|---|---|
| Median length | 12 |
| Mean length | 12.462268 |
| Min length | 3 |
Characters and Unicode
| Total characters | 529609 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Spin-coating |
|---|---|
| 2nd row | Spin-coating |
| 3rd row | Spin-coating |
| 4th row | Spin-coating |
| 5th row | Spin-coating |
Common Values
| Value | Count | Frequency (%) |
| Spin-coating | 35729 | |
| Unknown | 2906 | 6.8% |
| Spin-coating | Spin-coating | 1342 | 3.2% |
| Evaporation | 351 | 0.8% |
| Spray-pyrolys | 256 | 0.6% |
| Doctor blading | 181 | 0.4% |
| Spin-coating | Evaporation | 146 | 0.3% |
| Evaporation | Evaporation | 141 | 0.3% |
| Spray-coating | 118 | 0.3% |
| Sputtering | 106 | 0.2% |
| Other values (110) | 1221 | 2.9% |
Length
| Value | Count | Frequency (%) |
| spin-coating | 39035 | |
| unknown | 2927 | 6.2% |
| 2167 | 4.6% | |
| evaporation | 950 | 2.0% |
| spray-pyrolys | 285 | 0.6% |
| sputtering | 231 | 0.5% |
| blading | 204 | 0.4% |
| doctor | 204 | 0.4% |
| spray-coating | 163 | 0.3% |
| printing | 88 | 0.2% |
| Other values (58) | 1272 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 89582 | |
| i | 80929 | |
| o | 45819 | |
| a | 42565 | |
| t | 41837 | |
| p | 41578 | |
| g | 40050 | |
| c | 39946 | |
| S | 39797 | |
| - | 39757 | |
| Other values (37) | 27749 | 5.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 437556 | |
| Uppercase Letter | 45036 | 8.5% |
| Dash Punctuation | 39757 | 7.5% |
| Space Separator | 5029 | 0.9% |
| Math Symbol | 2231 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 89582 | |
| i | 80929 | |
| o | 45819 | |
| a | 42565 | |
| t | 41837 | |
| p | 41578 | |
| g | 40050 | |
| c | 39946 | |
| r | 3002 | 0.7% |
| k | 2934 | 0.7% |
| Other values (16) | 9314 | 2.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 39797 | |
| U | 2977 | 6.6% |
| E | 1119 | 2.5% |
| D | 554 | 1.2% |
| L | 154 | 0.3% |
| A | 105 | 0.2% |
| C | 72 | 0.2% |
| R | 55 | 0.1% |
| B | 42 | 0.1% |
| F | 39 | 0.1% |
| Other values (7) | 122 | 0.3% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 2103 | |
| > | 128 | 5.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 39757 |
Space Separator
| Value | Count | Frequency (%) |
| 5029 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 482592 | |
| Common | 47017 | 8.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 89582 | |
| i | 80929 | |
| o | 45819 | |
| a | 42565 | |
| t | 41837 | |
| p | 41578 | |
| g | 40050 | |
| c | 39946 | |
| S | 39797 | |
| r | 3002 | 0.6% |
| Other values (33) | 17487 | 3.6% |
Common
| Value | Count | Frequency (%) |
| - | 39757 | |
| 5029 | 10.7% | |
| | | 2103 | 4.5% |
| > | 128 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 529609 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 89582 | |
| i | 80929 | |
| o | 45819 | |
| a | 42565 | |
| t | 41837 | |
| p | 41578 | |
| g | 40050 | |
| c | 39946 | |
| S | 39797 | |
| - | 39757 | |
| Other values (37) | 27749 | 5.2% |
HTL_deposition_synthesis_atmosphere
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| Unknown | |
|---|---|
| N2 | 634 |
| Air | 310 |
| Ambient | 47 |
| Ar | 30 |
| Other values (14) | 114 |
Length
| Max length | 15 |
|---|---|
| Median length | 7 |
| Mean length | 6.8972633 |
| Min length | 2 |
Characters and Unicode
| Total characters | 293113 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unknown |
|---|---|
| 2nd row | Unknown |
| 3rd row | Unknown |
| 4th row | Unknown |
| 5th row | Unknown |
Common Values
| Value | Count | Frequency (%) |
| Unknown | 41362 | |
| N2 | 634 | 1.5% |
| Air | 310 | 0.7% |
| Ambient | 47 | 0.1% |
| Ar | 30 | 0.1% |
| Dry air | 26 | 0.1% |
| N2 | Vacuum | 18 | < 0.1% |
| Air | N2 | 13 | < 0.1% |
| Ar; O2 | 10 | < 0.1% |
| N2 | N2 | 10 | < 0.1% |
| Other values (9) | 37 | 0.1% |
Length
| Value | Count | Frequency (%) |
| unknown | 41362 | |
| n2 | 691 | 1.6% |
| air | 362 | 0.8% |
| 68 | 0.2% | |
| vacuum | 52 | 0.1% |
| ar | 49 | 0.1% |
| ambient | 47 | 0.1% |
| dry | 26 | 0.1% |
| o2 | 10 | < 0.1% |
| methanol | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 124135 | |
| o | 41364 | 14.1% |
| U | 41362 | 14.1% |
| k | 41362 | 14.1% |
| w | 41362 | 14.1% |
| 2 | 701 | 0.2% |
| N | 691 | 0.2% |
| r | 437 | 0.1% |
| A | 432 | 0.1% |
| i | 409 | 0.1% |
| Other values (17) | 858 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 249581 | |
| Uppercase Letter | 42573 | 14.5% |
| Decimal Number | 701 | 0.2% |
| Space Separator | 172 | 0.1% |
| Math Symbol | 76 | < 0.1% |
| Other Punctuation | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 124135 | |
| o | 41364 | 16.6% |
| k | 41362 | 16.6% |
| w | 41362 | 16.6% |
| r | 437 | 0.2% |
| i | 409 | 0.2% |
| u | 104 | < 0.1% |
| m | 101 | < 0.1% |
| a | 80 | < 0.1% |
| c | 52 | < 0.1% |
| Other values (6) | 175 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 41362 | |
| N | 691 | 1.6% |
| A | 432 | 1.0% |
| V | 52 | 0.1% |
| D | 26 | 0.1% |
| O | 10 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 60 | |
| > | 16 | 21.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 701 |
Space Separator
| Value | Count | Frequency (%) |
| 172 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 292154 | |
| Common | 959 | 0.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 124135 | |
| o | 41364 | 14.2% |
| U | 41362 | 14.2% |
| k | 41362 | 14.2% |
| w | 41362 | 14.2% |
| N | 691 | 0.2% |
| r | 437 | 0.1% |
| A | 432 | 0.1% |
| i | 409 | 0.1% |
| u | 104 | < 0.1% |
| Other values (12) | 496 | 0.2% |
Common
| Value | Count | Frequency (%) |
| 2 | 701 | |
| 172 | 17.9% | |
| | | 60 | 6.3% |
| > | 16 | 1.7% |
| ; | 10 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 293113 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 124135 | |
| o | 41364 | 14.1% |
| U | 41362 | 14.1% |
| k | 41362 | 14.1% |
| w | 41362 | 14.1% |
| 2 | 701 | 0.2% |
| N | 691 | 0.2% |
| r | 437 | 0.1% |
| A | 432 | 0.1% |
| i | 409 | 0.1% |
| Other values (17) | 858 | 0.3% |
HTL_deposition_solvents
Categorical
HIGH CARDINALITY  HIGH CORRELATION  IMBALANCE 
| Distinct | 51 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| Unknown | |
|---|---|
| Chlorobenzene | 1114 |
| Toluene | 128 |
| Water | 94 |
| Ethanol | 38 |
| Other values (46) | 318 |
Length
| Max length | 37 |
|---|---|
| Median length | 7 |
| Mean length | 7.2251453 |
| Min length | 3 |
Characters and Unicode
| Total characters | 307047 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unknown |
|---|---|
| 2nd row | Unknown |
| 3rd row | Unknown |
| 4th row | Unknown |
| 5th row | Unknown |
Common Values
| Value | Count | Frequency (%) |
| Unknown | 40805 | |
| Chlorobenzene | 1114 | 2.6% |
| Toluene | 128 | 0.3% |
| Water | 94 | 0.2% |
| Ethanol | 38 | 0.1% |
| Toluene | Methanol | 31 | 0.1% |
| IPA; Water | 22 | 0.1% |
| Chlorobenzene | none | 21 | < 0.1% |
| IPA | 17 | < 0.1% |
| Diethyl sulfide | 16 | < 0.1% |
| Other values (41) | 211 | 0.5% |
Length
| Value | Count | Frequency (%) |
| unknown | 40822 | |
| chlorobenzene | 1215 | 2.8% |
| toluene | 161 | 0.4% |
| 150 | 0.3% | |
| water | 146 | 0.3% |
| ipa | 69 | 0.2% |
| ethanol | 61 | 0.1% |
| none | 53 | 0.1% |
| methanol | 40 | 0.1% |
| acetonitrile | 21 | < 0.1% |
| Other values (22) | 179 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 125382 | |
| o | 43759 | 14.3% |
| w | 40822 | 13.3% |
| U | 40822 | 13.3% |
| k | 40822 | 13.3% |
| e | 4437 | 1.4% |
| l | 1669 | 0.5% |
| r | 1425 | 0.5% |
| h | 1414 | 0.5% |
| C | 1227 | 0.4% |
| Other values (35) | 5268 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 263514 | |
| Uppercase Letter | 42835 | 14.0% |
| Space Separator | 420 | 0.1% |
| Math Symbol | 152 | < 0.1% |
| Other Punctuation | 74 | < 0.1% |
| Decimal Number | 27 | < 0.1% |
| Dash Punctuation | 25 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 125382 | |
| o | 43759 | 16.6% |
| w | 40822 | 15.5% |
| k | 40822 | 15.5% |
| e | 4437 | 1.7% |
| l | 1669 | 0.6% |
| r | 1425 | 0.5% |
| h | 1414 | 0.5% |
| z | 1224 | 0.5% |
| b | 1224 | 0.5% |
| Other values (13) | 1336 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 40822 | |
| C | 1227 | 2.9% |
| T | 186 | 0.4% |
| W | 146 | 0.3% |
| A | 84 | 0.2% |
| E | 77 | 0.2% |
| P | 69 | 0.2% |
| I | 69 | 0.2% |
| M | 67 | 0.2% |
| D | 36 | 0.1% |
| Other values (4) | 52 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 148 | |
| > | 4 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 72 | |
| , | 2 | 2.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 25 | |
| 1 | 2 | 7.4% |
Space Separator
| Value | Count | Frequency (%) |
| 420 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 306349 | |
| Common | 698 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 125382 | |
| o | 43759 | 14.3% |
| w | 40822 | 13.3% |
| U | 40822 | 13.3% |
| k | 40822 | 13.3% |
| e | 4437 | 1.4% |
| l | 1669 | 0.5% |
| r | 1425 | 0.5% |
| h | 1414 | 0.5% |
| C | 1227 | 0.4% |
| Other values (27) | 4570 | 1.5% |
Common
| Value | Count | Frequency (%) |
| 420 | ||
| | | 148 | 21.2% |
| ; | 72 | 10.3% |
| 2 | 25 | 3.6% |
| - | 25 | 3.6% |
| > | 4 | 0.6% |
| 1 | 2 | 0.3% |
| , | 2 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 307047 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 125382 | |
| o | 43759 | 14.3% |
| w | 40822 | 13.3% |
| U | 40822 | 13.3% |
| k | 40822 | 13.3% |
| e | 4437 | 1.4% |
| l | 1669 | 0.5% |
| r | 1425 | 0.5% |
| h | 1414 | 0.5% |
| C | 1227 | 0.4% |
| Other values (35) | 5268 | 1.7% |
HTL_deposition_solvents_mixing_ratios
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 14 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 41988 |
| Missing (%) | 98.8% |
| Memory size | 332.1 KiB |
| 1 | |
|---|---|
| 1 | nan | 26 |
| 1 | 1 | 20 |
| 5; 1 | 15 |
| 1; 8 | 10 |
| Other values (9) | 27 |
Length
| Max length | 14 |
|---|---|
| Median length | 1 |
| Mean length | 1.9056974 |
| Min length | 1 |
Characters and Unicode
| Total characters | 970 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 411 | 1.0% |
| 1 | nan | 26 | 0.1% |
| 1 | 1 | 20 | < 0.1% |
| 5; 1 | 15 | < 0.1% |
| 1; 8 | 10 | < 0.1% |
| 1; 0.012 | 5 | < 0.1% |
| 1; 0.1 | 5 | < 0.1% |
| 1; 1 | 5 | < 0.1% |
| 0.1; 51 | 4 | < 0.1% |
| nan | 1 | 3 | < 0.1% |
| Other values (4) | 5 | < 0.1% |
| (Missing) | 41988 |
Length
| Value | Count | Frequency (%) |
| 1 | 532 | |
| 52 | 7.9% | |
| nan | 30 | 4.5% |
| 5 | 15 | 2.3% |
| 8 | 10 | 1.5% |
| 0.1 | 9 | 1.4% |
| 0.012 | 5 | 0.8% |
| 51 | 4 | 0.6% |
| 0.006 | 2 | 0.3% |
| 3 | 1 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 550 | |
| 151 | 15.6% | |
| n | 60 | 6.2% |
| | | 50 | 5.2% |
| ; | 47 | 4.8% |
| a | 30 | 3.1% |
| 0 | 25 | 2.6% |
| 5 | 19 | 2.0% |
| . | 16 | 1.6% |
| 8 | 10 | 1.0% |
| Other values (4) | 12 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 612 | |
| Space Separator | 151 | 15.6% |
| Lowercase Letter | 90 | 9.3% |
| Other Punctuation | 63 | 6.5% |
| Math Symbol | 54 | 5.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 550 | |
| 0 | 25 | 4.1% |
| 5 | 19 | 3.1% |
| 8 | 10 | 1.6% |
| 2 | 5 | 0.8% |
| 6 | 2 | 0.3% |
| 3 | 1 | 0.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 60 | |
| a | 30 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 50 | |
| > | 4 | 7.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 47 | |
| . | 16 | 25.4% |
Space Separator
| Value | Count | Frequency (%) |
| 151 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 880 | |
| Latin | 90 | 9.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 550 | |
| 151 | 17.2% | |
| | | 50 | 5.7% |
| ; | 47 | 5.3% |
| 0 | 25 | 2.8% |
| 5 | 19 | 2.2% |
| . | 16 | 1.8% |
| 8 | 10 | 1.1% |
| 2 | 5 | 0.6% |
| > | 4 | 0.5% |
| Other values (2) | 3 | 0.3% |
Latin
| Value | Count | Frequency (%) |
| n | 60 | |
| a | 30 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 550 | |
| 151 | 15.6% | |
| n | 60 | 6.2% |
| | | 50 | 5.2% |
| ; | 47 | 4.8% |
| a | 30 | 3.1% |
| 0 | 25 | 2.6% |
| 5 | 19 | 2.0% |
| . | 16 | 1.6% |
| 8 | 10 | 1.0% |
| Other values (4) | 12 | 1.2% |
Backcontact_stack_sequence
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 289 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| Au | |
|---|---|
| Ag | |
| Al | |
| Carbon | |
| MoO3 | Ag | 693 |
| Other values (284) |
Length
| Max length | 49 |
|---|---|
| Median length | 2 |
| Mean length | 2.7728546 |
| Min length | 1 |
Characters and Unicode
| Total characters | 117838 |
|---|---|
| Distinct characters | 63 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 86 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Au |
|---|---|
| 2nd row | Au |
| 3rd row | Au |
| 4th row | Au |
| 5th row | Au |
Common Values
| Value | Count | Frequency (%) |
| Au | 20531 | |
| Ag | 12985 | |
| Al | 3138 | 7.4% |
| Carbon | 2132 | 5.0% |
| MoO3 | Ag | 693 | 1.6% |
| Cu | 536 | 1.3% |
| Ca | Al | 307 | 0.7% |
| MoO3 | Al | 120 | 0.3% |
| MoO3 | Au | 117 | 0.3% |
| AgAl | 77 | 0.2% |
| Other values (279) | 1861 | 4.4% |
Length
| Value | Count | Frequency (%) |
| au | 20914 | |
| ag | 14131 | |
| al | 3686 | 7.6% |
| 2799 | 5.8% | |
| carbon | 2383 | 4.9% |
| moo3 | 1105 | 2.3% |
| cu | 599 | 1.2% |
| ca | 337 | 0.7% |
| moox | 212 | 0.4% |
| ito | 208 | 0.4% |
| Other values (170) | 1978 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 39203 | |
| u | 21578 | |
| g | 14402 | 12.2% |
| 5855 | 5.0% | |
| l | 3911 | 3.3% |
| o | 3888 | 3.3% |
| C | 3615 | 3.1% |
| a | 3144 | 2.7% |
| n | 2923 | 2.5% |
| | | 2799 | 2.4% |
| Other values (53) | 16520 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 57168 | |
| Uppercase Letter | 50022 | |
| Space Separator | 5855 | 5.0% |
| Math Symbol | 2825 | 2.4% |
| Decimal Number | 1344 | 1.1% |
| Dash Punctuation | 340 | 0.3% |
| Other Punctuation | 282 | 0.2% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 21578 | |
| g | 14402 | |
| l | 3911 | 6.8% |
| o | 3888 | 6.8% |
| a | 3144 | 5.5% |
| n | 2923 | 5.1% |
| r | 2723 | 4.8% |
| b | 2558 | 4.5% |
| t | 323 | 0.6% |
| e | 321 | 0.6% |
| Other values (14) | 1397 | 2.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 39203 | |
| C | 3615 | 7.2% |
| O | 2086 | 4.2% |
| M | 1556 | 3.1% |
| T | 685 | 1.4% |
| P | 523 | 1.0% |
| S | 521 | 1.0% |
| I | 317 | 0.6% |
| G | 296 | 0.6% |
| F | 246 | 0.5% |
| Other values (12) | 974 | 1.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1183 | |
| 2 | 109 | 8.1% |
| 0 | 18 | 1.3% |
| 6 | 13 | 1.0% |
| 4 | 9 | 0.7% |
| 5 | 6 | 0.4% |
| 1 | 6 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 131 | |
| ; | 120 | |
| @ | 27 | 9.6% |
| ' | 4 | 1.4% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 2799 | |
| ∣ | 26 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 5855 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 340 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 107190 | |
| Common | 10648 | 9.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 39203 | |
| u | 21578 | |
| g | 14402 | 13.4% |
| l | 3911 | 3.6% |
| o | 3888 | 3.6% |
| C | 3615 | 3.4% |
| a | 3144 | 2.9% |
| n | 2923 | 2.7% |
| r | 2723 | 2.5% |
| b | 2558 | 2.4% |
| Other values (36) | 9245 | 8.6% |
Common
| Value | Count | Frequency (%) |
| 5855 | ||
| | | 2799 | |
| 3 | 1183 | 11.1% |
| - | 340 | 3.2% |
| : | 131 | 1.2% |
| ; | 120 | 1.1% |
| 2 | 109 | 1.0% |
| @ | 27 | 0.3% |
| ∣ | 26 | 0.2% |
| 0 | 18 | 0.2% |
| Other values (7) | 40 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 117812 | |
| Math Operators | 26 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 39203 | |
| u | 21578 | |
| g | 14402 | 12.2% |
| 5855 | 5.0% | |
| l | 3911 | 3.3% |
| o | 3888 | 3.3% |
| C | 3615 | 3.1% |
| a | 3144 | 2.7% |
| n | 2923 | 2.5% |
| | | 2799 | 2.4% |
| Other values (52) | 16494 |
Math Operators
| Value | Count | Frequency (%) |
| ∣ | 26 |
Backcontact_thickness_list
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 384 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 7976 |
| Missing (%) | 18.8% |
| Memory size | 332.1 KiB |
| 100.0 | |
|---|---|
| 80.0 | |
| 60.0 | |
| 70.0 | |
| 120.0 | |
| Other values (379) |
Length
| Max length | 32 |
|---|---|
| Median length | 31 |
| Mean length | 4.9911938 |
| Min length | 3 |
Characters and Unicode
| Total characters | 172301 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 126 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 90.0 |
|---|---|
| 2nd row | 90.0 |
| 3rd row | 90.0 |
| 4th row | 90.0 |
| 5th row | 90.0 |
Common Values
| Value | Count | Frequency (%) |
| 100.0 | 12528 | |
| 80.0 | 8638 | |
| 60.0 | 2500 | 5.9% |
| 70.0 | 1817 | 4.3% |
| 120.0 | 1488 | 3.5% |
| 150.0 | 1391 | 3.3% |
| 50.0 | 875 | 2.1% |
| 90.0 | 677 | 1.6% |
| 10000.0 | 385 | 0.9% |
| 200.0 | 338 | 0.8% |
| Other values (374) | 3884 | 9.1% |
| (Missing) | 7976 |
Length
| Value | Count | Frequency (%) |
| 100.0 | 13421 | |
| 80.0 | 8892 | |
| 60.0 | 2538 | 6.5% |
| 2134 | 5.5% | |
| 70.0 | 1887 | 4.9% |
| 120.0 | 1620 | 4.2% |
| 150.0 | 1461 | 3.8% |
| 50.0 | 948 | 2.4% |
| 90.0 | 697 | 1.8% |
| 10.0 | 592 | 1.5% |
| Other values (174) | 4599 | 11.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 87223 | |
| . | 36411 | |
| 1 | 18828 | 10.9% |
| 8 | 9311 | 5.4% |
| 4268 | 2.5% | |
| 5 | 3779 | 2.2% |
| 6 | 2876 | 1.7% |
| 2 | 2728 | 1.6% |
| 7 | 2338 | 1.4% |
| | | 2134 | 1.2% |
| Other values (6) | 2405 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 128772 | |
| Other Punctuation | 36411 | 21.1% |
| Space Separator | 4268 | 2.5% |
| Math Symbol | 2134 | 1.2% |
| Lowercase Letter | 708 | 0.4% |
| Dash Punctuation | 8 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 87223 | |
| 1 | 18828 | 14.6% |
| 8 | 9311 | 7.2% |
| 5 | 3779 | 2.9% |
| 6 | 2876 | 2.2% |
| 2 | 2728 | 2.1% |
| 7 | 2338 | 1.8% |
| 9 | 806 | 0.6% |
| 3 | 464 | 0.4% |
| 4 | 419 | 0.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 472 | |
| a | 236 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 36411 |
Space Separator
| Value | Count | Frequency (%) |
| 4268 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 2134 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 171593 | |
| Latin | 708 | 0.4% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 87223 | |
| . | 36411 | |
| 1 | 18828 | 11.0% |
| 8 | 9311 | 5.4% |
| 4268 | 2.5% | |
| 5 | 3779 | 2.2% |
| 6 | 2876 | 1.7% |
| 2 | 2728 | 1.6% |
| 7 | 2338 | 1.4% |
| | | 2134 | 1.2% |
| Other values (4) | 1697 | 1.0% |
Latin
| Value | Count | Frequency (%) |
| n | 472 | |
| a | 236 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 172301 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 87223 | |
| . | 36411 | |
| 1 | 18828 | 10.9% |
| 8 | 9311 | 5.4% |
| 4268 | 2.5% | |
| 5 | 3779 | 2.2% |
| 6 | 2876 | 1.7% |
| 2 | 2728 | 1.6% |
| 7 | 2338 | 1.4% |
| | | 2134 | 1.2% |
| Other values (6) | 2405 | 1.4% |
Backcontact_deposition_procedure
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 113 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| Evaporation | |
|---|---|
| Evaporation | Evaporation | 1633 |
| Doctor blading | 1187 |
| Screen printing | 881 |
| Sputtering | 269 |
| Other values (108) | 1682 |
Length
| Max length | 95 |
|---|---|
| Median length | 11 |
| Mean length | 12.009036 |
| Min length | 3 |
Characters and Unicode
| Total characters | 510348 |
|---|---|
| Distinct characters | 42 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 23 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Evaporation |
|---|---|
| 2nd row | Evaporation |
| 3rd row | Evaporation |
| 4th row | Evaporation |
| 5th row | Evaporation |
Common Values
| Value | Count | Frequency (%) |
| Evaporation | 36845 | |
| Evaporation | Evaporation | 1633 | 3.8% |
| Doctor blading | 1187 | 2.8% |
| Screen printing | 881 | 2.1% |
| Sputtering | 269 | 0.6% |
| Unknown | 269 | 0.6% |
| Lamination | 230 | 0.5% |
| Sandwiching | 138 | 0.3% |
| Magnetron sputtering | 106 | 0.2% |
| Evaporation | Evaporation | Evaporation | 81 | 0.2% |
| Other values (103) | 858 | 2.0% |
Length
| Value | Count | Frequency (%) |
| evaporation | 40822 | |
| 2426 | 4.8% | |
| doctor | 1239 | 2.5% |
| blading | 1239 | 2.5% |
| printing | 939 | 1.9% |
| screen | 911 | 1.8% |
| sputtering | 610 | 1.2% |
| lamination | 307 | 0.6% |
| unknown | 297 | 0.6% |
| sandwiching | 201 | 0.4% |
| Other values (43) | 1085 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 85202 | |
| a | 84571 | |
| n | 48512 | |
| i | 46301 | |
| t | 45145 | |
| r | 45020 | |
| p | 42756 | |
| v | 40836 | |
| E | 40822 | |
| 7581 | 1.5% | |
| Other values (32) | 23602 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 454809 | |
| Uppercase Letter | 45115 | 8.8% |
| Space Separator | 7581 | 1.5% |
| Math Symbol | 2483 | 0.5% |
| Dash Punctuation | 360 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 85202 | |
| a | 84571 | |
| n | 48512 | |
| i | 46301 | |
| t | 45145 | |
| r | 45020 | |
| p | 42756 | |
| v | 40836 | |
| g | 3598 | 0.8% |
| e | 3069 | 0.7% |
| Other values (13) | 9799 | 2.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 40822 | |
| S | 1801 | 4.0% |
| D | 1384 | 3.1% |
| L | 311 | 0.7% |
| U | 299 | 0.7% |
| M | 168 | 0.4% |
| C | 102 | 0.2% |
| B | 84 | 0.2% |
| P | 51 | 0.1% |
| I | 29 | 0.1% |
| Other values (5) | 64 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 2369 | |
| > | 114 | 4.6% |
Space Separator
| Value | Count | Frequency (%) |
| 7581 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 360 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 499924 | |
| Common | 10424 | 2.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 85202 | |
| a | 84571 | |
| n | 48512 | |
| i | 46301 | |
| t | 45145 | |
| r | 45020 | |
| p | 42756 | |
| v | 40836 | |
| E | 40822 | |
| g | 3598 | 0.7% |
| Other values (28) | 17161 | 3.4% |
Common
| Value | Count | Frequency (%) |
| 7581 | ||
| | | 2369 | 22.7% |
| - | 360 | 3.5% |
| > | 114 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 510348 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 85202 | |
| a | 84571 | |
| n | 48512 | |
| i | 46301 | |
| t | 45145 | |
| r | 45020 | |
| p | 42756 | |
| v | 40836 | |
| E | 40822 | |
| 7581 | 1.5% | |
| Other values (32) | 23602 | 4.6% |
Add_lay_front_stack_sequence
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| Unknown | |
|---|---|
| MgF2 | 72 |
| Ag-np | 10 |
| Eu(TTA)2(Phen)MAA | 10 |
| NaYF4:Eu-np | 8 |
| Other values (16) | 34 |
Length
| Max length | 28 |
|---|---|
| Median length | 7 |
| Mean length | 6.9998588 |
| Min length | 3 |
Characters and Unicode
| Total characters | 297473 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unknown |
|---|---|
| 2nd row | Unknown |
| 3rd row | Unknown |
| 4th row | Unknown |
| 5th row | Unknown |
Common Values
| Value | Count | Frequency (%) |
| Unknown | 42363 | |
| MgF2 | 72 | 0.2% |
| Ag-np | 10 | < 0.1% |
| Eu(TTA)2(Phen)MAA | 10 | < 0.1% |
| NaYF4:Eu-np | 8 | < 0.1% |
| CdSeS-QDs | 4 | < 0.1% |
| Mn:CsPbCl3-QDs | 4 | < 0.1% |
| N-Graphene-QDs | 3 | < 0.1% |
| Mica | 3 | < 0.1% |
| NaF | 3 | < 0.1% |
| Other values (11) | 17 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| unknown | 42363 | |
| mgf2 | 72 | 0.2% |
| ag-np | 10 | < 0.1% |
| eu(tta)2(phen)maa | 10 | < 0.1% |
| nayf4:eu-np | 8 | < 0.1% |
| cdses-qds | 4 | < 0.1% |
| mn:cspbcl3-qds | 4 | < 0.1% |
| y2o3:eu3 | 4 | < 0.1% |
| n-graphene-qds | 3 | < 0.1% |
| mica | 3 | < 0.1% |
| Other values (16) | 27 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 127131 | |
| o | 42370 | 14.2% |
| U | 42363 | 14.2% |
| k | 42363 | 14.2% |
| w | 42363 | 14.2% |
| M | 93 | < 0.1% |
| 2 | 86 | < 0.1% |
| F | 85 | < 0.1% |
| g | 83 | < 0.1% |
| A | 44 | < 0.1% |
| Other values (42) | 492 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 254511 | |
| Uppercase Letter | 42746 | 14.4% |
| Decimal Number | 106 | < 0.1% |
| Dash Punctuation | 40 | < 0.1% |
| Close Punctuation | 20 | < 0.1% |
| Open Punctuation | 20 | < 0.1% |
| Other Punctuation | 16 | < 0.1% |
| Space Separator | 11 | < 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 127131 | |
| o | 42370 | 16.6% |
| k | 42363 | 16.6% |
| w | 42363 | 16.6% |
| g | 83 | < 0.1% |
| e | 32 | < 0.1% |
| u | 27 | < 0.1% |
| p | 26 | < 0.1% |
| s | 20 | < 0.1% |
| a | 19 | < 0.1% |
| Other values (13) | 77 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 42363 | |
| M | 93 | 0.2% |
| F | 85 | 0.2% |
| A | 44 | 0.1% |
| E | 23 | 0.1% |
| P | 20 | < 0.1% |
| T | 20 | < 0.1% |
| D | 17 | < 0.1% |
| N | 15 | < 0.1% |
| Q | 13 | < 0.1% |
| Other values (10) | 53 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 86 | |
| 3 | 12 | 11.3% |
| 4 | 8 | 7.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 40 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 20 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 20 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 16 |
Space Separator
| Value | Count | Frequency (%) |
| 11 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 297257 | |
| Common | 216 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 127131 | |
| o | 42370 | 14.3% |
| U | 42363 | 14.3% |
| k | 42363 | 14.3% |
| w | 42363 | 14.3% |
| M | 93 | < 0.1% |
| F | 85 | < 0.1% |
| g | 83 | < 0.1% |
| A | 44 | < 0.1% |
| e | 32 | < 0.1% |
| Other values (33) | 330 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 2 | 86 | |
| - | 40 | |
| ) | 20 | 9.3% |
| ( | 20 | 9.3% |
| : | 16 | 7.4% |
| 3 | 12 | 5.6% |
| 11 | 5.1% | |
| 4 | 8 | 3.7% |
| | | 3 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 297473 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 127131 | |
| o | 42370 | 14.2% |
| U | 42363 | 14.2% |
| k | 42363 | 14.2% |
| w | 42363 | 14.2% |
| M | 93 | < 0.1% |
| 2 | 86 | < 0.1% |
| F | 85 | < 0.1% |
| g | 83 | < 0.1% |
| A | 44 | < 0.1% |
| Other values (42) | 492 | 0.2% |
Add_lay_back
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.6 KiB |
| False | |
|---|---|
| True | 28 |
| Value | Count | Frequency (%) |
| False | 42469 | |
| True | 28 | 0.1% |
Encapsulation
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.6 KiB |
| False | |
|---|---|
| True | 1743 |
| Value | Count | Frequency (%) |
| False | 40754 | |
| True | 1743 | 4.1% |
Encapsulation_stack_sequence
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 118 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 332.1 KiB |
| Unknown | |
|---|---|
| SLG | 277 |
| Cover glass-QDs | 121 |
| Epoxy | 94 |
| UV-curable epoxy | 83 |
| Other values (113) | 636 |
Length
| Max length | 67 |
|---|---|
| Median length | 7 |
| Mean length | 7.135186 |
| Min length | 3 |
Characters and Unicode
| Total characters | 303224 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 33 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Unknown |
|---|---|
| 2nd row | Unknown |
| 3rd row | Unknown |
| 4th row | Unknown |
| 5th row | Unknown |
Common Values
| Value | Count | Frequency (%) |
| Unknown | 41286 | |
| SLG | 277 | 0.7% |
| Cover glass-QDs | 121 | 0.3% |
| Epoxy | 94 | 0.2% |
| UV-curable epoxy | 83 | 0.2% |
| PMMA | 58 | 0.1% |
| UV-curable epoxy | Cover glass-QDs | 29 | 0.1% |
| Polyisobutene | 26 | 0.1% |
| Polymer | 25 | 0.1% |
| UV-glue | SLG | 25 | 0.1% |
| Other values (108) | 473 | 1.1% |
Length
| Value | Count | Frequency (%) |
| unknown | 41286 | |
| slg | 416 | 1.0% |
| epoxy | 297 | 0.7% |
| 233 | 0.5% | |
| cover | 174 | 0.4% |
| glass-qds | 172 | 0.4% |
| uv-curable | 118 | 0.3% |
| pmma | 59 | 0.1% |
| uv-glue | 54 | 0.1% |
| surlyn | 49 | 0.1% |
| Other values (135) | 833 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 124143 | |
| o | 42008 | 13.9% |
| U | 41533 | 13.7% |
| w | 41313 | 13.6% |
| k | 41287 | 13.6% |
| 1194 | 0.4% | |
| e | 972 | 0.3% |
| s | 775 | 0.3% |
| l | 763 | 0.3% |
| r | 647 | 0.2% |
| Other values (54) | 8589 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 256039 | |
| Uppercase Letter | 44846 | 14.8% |
| Space Separator | 1194 | 0.4% |
| Dash Punctuation | 407 | 0.1% |
| Decimal Number | 263 | 0.1% |
| Math Symbol | 233 | 0.1% |
| Close Punctuation | 85 | < 0.1% |
| Open Punctuation | 85 | < 0.1% |
| Other Punctuation | 72 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 124143 | |
| o | 42008 | 16.4% |
| w | 41313 | 16.1% |
| k | 41287 | 16.1% |
| e | 972 | 0.4% |
| s | 775 | 0.3% |
| l | 763 | 0.3% |
| r | 647 | 0.3% |
| a | 615 | 0.2% |
| y | 505 | 0.2% |
| Other values (13) | 3011 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 41533 | |
| S | 507 | 1.1% |
| L | 428 | 1.0% |
| G | 428 | 1.0% |
| V | 286 | 0.6% |
| P | 237 | 0.5% |
| C | 219 | 0.5% |
| D | 198 | 0.4% |
| E | 192 | 0.4% |
| Q | 172 | 0.4% |
| Other values (12) | 646 | 1.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 91 | |
| 2 | 69 | |
| 0 | 29 | 11.0% |
| 5 | 27 | 10.3% |
| 1 | 22 | 8.4% |
| 4 | 14 | 5.3% |
| 9 | 9 | 3.4% |
| 6 | 1 | 0.4% |
| 8 | 1 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 35 | |
| ; | 26 | |
| : | 6 | 8.3% |
| ' | 3 | 4.2% |
| . | 2 | 2.8% |
Space Separator
| Value | Count | Frequency (%) |
| 1194 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 407 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 233 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 85 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 85 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 300885 | |
| Common | 2339 | 0.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 124143 | |
| o | 42008 | 14.0% |
| U | 41533 | 13.8% |
| w | 41313 | 13.7% |
| k | 41287 | 13.7% |
| e | 972 | 0.3% |
| s | 775 | 0.3% |
| l | 763 | 0.3% |
| r | 647 | 0.2% |
| a | 615 | 0.2% |
| Other values (35) | 6829 | 2.3% |
Common
| Value | Count | Frequency (%) |
| 1194 | ||
| - | 407 | 17.4% |
| | | 233 | 10.0% |
| 3 | 91 | 3.9% |
| ) | 85 | 3.6% |
| ( | 85 | 3.6% |
| 2 | 69 | 2.9% |
| , | 35 | 1.5% |
| 0 | 29 | 1.2% |
| 5 | 27 | 1.2% |
| Other values (9) | 84 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 303224 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 124143 | |
| o | 42008 | 13.9% |
| U | 41533 | 13.7% |
| w | 41313 | 13.6% |
| k | 41287 | 13.6% |
| 1194 | 0.4% | |
| e | 972 | 0.3% |
| s | 775 | 0.3% |
| l | 763 | 0.3% |
| r | 647 | 0.2% |
| Other values (54) | 8589 | 2.8% |
JV_light_intensity
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 110 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 65 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 99.948904 |
| Minimum | 0 |
|---|---|
| Maximum | 1800 |
| Zeros | 8 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 332.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 100 |
| Q1 | 100 |
| median | 100 |
| Q3 | 100 |
| 95-th percentile | 100 |
| Maximum | 1800 |
| Range | 1800 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 19.274965 |
|---|---|
| Coefficient of variation (CV) | 0.19284819 |
| Kurtosis | 5808.1874 |
| Mean | 99.948904 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 71.515835 |
| Sum | 4241031.9 |
| Variance | 371.52428 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 42021 | |
| 200 | 25 | 0.1% |
| 50 | 22 | 0.1% |
| 10 | 19 | < 0.1% |
| 0.146 | 16 | < 0.1% |
| 98 | 16 | < 0.1% |
| 95 | 15 | < 0.1% |
| 80 | 12 | < 0.1% |
| 95.6 | 12 | < 0.1% |
| 81 | 11 | < 0.1% |
| Other values (100) | 263 | 0.6% |
| (Missing) | 65 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 8 | |
| 0.000146 | 2 | < 0.1% |
| 0.0058 | 5 | < 0.1% |
| 0.03 | 7 | |
| 0.062 | 1 | < 0.1% |
| 0.1 | 1 | < 0.1% |
| 0.14 | 1 | < 0.1% |
| 0.146 | 16 | |
| 0.1516 | 3 | < 0.1% |
| 0.2754 | 7 |
| Value | Count | Frequency (%) |
| 1800 | 2 | < 0.1% |
| 1600 | 3 | < 0.1% |
| 1300 | 1 | < 0.1% |
| 500 | 1 | < 0.1% |
| 480 | 1 | < 0.1% |
| 300 | 1 | < 0.1% |
| 200 | 25 | |
| 150 | 3 | < 0.1% |
| 137 | 4 | < 0.1% |
| 120 | 1 | < 0.1% |
JV_light_spectra
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2491 |
| Missing (%) | 5.9% |
| Memory size | 332.1 KiB |
| AM 1.5 | |
|---|---|
| Am 1.5 | 56 |
| Indoor light | 43 |
| Monochromatic | 1 |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.006624 |
| Min length | 6 |
Characters and Unicode
| Total characters | 240301 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | AM 1.5 |
|---|---|
| 2nd row | AM 1.5 |
| 3rd row | AM 1.5 |
| 4th row | AM 1.5 |
| 5th row | AM 1.5 |
Common Values
| Value | Count | Frequency (%) |
| AM 1.5 | 39906 | |
| Am 1.5 | 56 | 0.1% |
| Indoor light | 43 | 0.1% |
| Monochromatic | 1 | < 0.1% |
| (Missing) | 2491 | 5.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| am | 39962 | |
| 1.5 | 39962 | |
| indoor | 43 | 0.1% |
| light | 43 | 0.1% |
| monochromatic | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 40005 | ||
| A | 39962 | |
| 1 | 39962 | |
| . | 39962 | |
| 5 | 39962 | |
| M | 39907 | |
| o | 89 | < 0.1% |
| m | 57 | < 0.1% |
| t | 44 | < 0.1% |
| n | 44 | < 0.1% |
| Other values (9) | 307 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 79924 | |
| Uppercase Letter | 79912 | |
| Space Separator | 40005 | |
| Other Punctuation | 39962 | |
| Lowercase Letter | 498 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 89 | |
| m | 57 | |
| t | 44 | |
| n | 44 | |
| r | 44 | |
| h | 44 | |
| i | 44 | |
| g | 43 | |
| d | 43 | |
| l | 43 | |
| Other values (2) | 3 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 39962 | |
| M | 39907 | |
| I | 43 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 39962 | |
| 5 | 39962 |
Space Separator
| Value | Count | Frequency (%) |
| 40005 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 39962 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 159891 | |
| Latin | 80410 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 39962 | |
| M | 39907 | |
| o | 89 | 0.1% |
| m | 57 | 0.1% |
| t | 44 | 0.1% |
| n | 44 | 0.1% |
| r | 44 | 0.1% |
| h | 44 | 0.1% |
| i | 44 | 0.1% |
| g | 43 | 0.1% |
| Other values (5) | 132 | 0.2% |
Common
| Value | Count | Frequency (%) |
| 40005 | ||
| 1 | 39962 | |
| . | 39962 | |
| 5 | 39962 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 240301 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 40005 | ||
| A | 39962 | |
| 1 | 39962 | |
| . | 39962 | |
| 5 | 39962 | |
| M | 39907 | |
| o | 89 | < 0.1% |
| m | 57 | < 0.1% |
| t | 44 | < 0.1% |
| n | 44 | < 0.1% |
| Other values (9) | 307 | 0.1% |
JV_scan_speed
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED 
| Distinct | 175 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 26362 |
| Missing (%) | 62.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 670.62527 |
| Minimum | 0.01 |
|---|---|
| Maximum | 5000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 332.1 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 33 |
| median | 100 |
| Q3 | 150 |
| 95-th percentile | 607.5 |
| Maximum | 5000000 |
| Range | 5000000 |
| Interquartile range (IQR) | 117 |
Descriptive statistics
| Standard deviation | 39884.645 |
|---|---|
| Coefficient of variation (CV) | 59.473818 |
| Kurtosis | 15305.475 |
| Mean | 670.62527 |
| Median Absolute Deviation (MAD) | 60 |
| Skewness | 122.33927 |
| Sum | 10820539 |
| Variance | 1.5907849 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 4288 | 10.1% |
| 50 | 2190 | 5.2% |
| 10 | 1990 | 4.7% |
| 200 | 983 | 2.3% |
| 20 | 963 | 2.3% |
| 150 | 509 | 1.2% |
| 30 | 487 | 1.1% |
| 300 | 471 | 1.1% |
| 1000 | 407 | 1.0% |
| 500 | 383 | 0.9% |
| Other values (165) | 3464 | 8.2% |
| (Missing) | 26362 |
| Value | Count | Frequency (%) |
| 0.01 | 2 | < 0.1% |
| 0.02 | 5 | < 0.1% |
| 0.1 | 29 | |
| 0.2 | 3 | < 0.1% |
| 0.25 | 1 | < 0.1% |
| 0.28 | 2 | < 0.1% |
| 0.3 | 4 | < 0.1% |
| 0.4 | 8 | < 0.1% |
| 0.6 | 4 | < 0.1% |
| 1 | 17 |
| Value | Count | Frequency (%) |
| 5000000 | 1 | < 0.1% |
| 500000 | 2 | < 0.1% |
| 165000 | 3 | < 0.1% |
| 100000 | 7 | < 0.1% |
| 50000 | 1 | < 0.1% |
| 30000 | 5 | < 0.1% |
| 20000 | 22 | |
| 18800 | 1 | < 0.1% |
| 10000 | 26 | |
| 7300 | 2 | < 0.1% |
JV_scan_delay_time
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 10 |
|---|---|
| Distinct (%) | 7.9% |
| Missing | 42370 |
| Missing (%) | 99.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 192.71496 |
| Minimum | 0.3 |
|---|---|
| Maximum | 1000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 332.1 KiB |
Quantile statistics
| Minimum | 0.3 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 50 |
| median | 100 |
| Q3 | 200 |
| 95-th percentile | 500 |
| Maximum | 1000 |
| Range | 999.7 |
| Interquartile range (IQR) | 150 |
Descriptive statistics
| Standard deviation | 206.77884 |
|---|---|
| Coefficient of variation (CV) | 1.0729776 |
| Kurtosis | 3.7770974 |
| Mean | 192.71496 |
| Median Absolute Deviation (MAD) | 99 |
| Skewness | 1.8306026 |
| Sum | 24474.8 |
| Variance | 42757.487 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 200 | 30 | 0.1% |
| 100 | 28 | 0.1% |
| 500 | 22 | 0.1% |
| 50 | 21 | < 0.1% |
| 10 | 9 | < 0.1% |
| 0.3 | 6 | < 0.1% |
| 1 | 3 | < 0.1% |
| 1000 | 3 | < 0.1% |
| 150 | 3 | < 0.1% |
| 40 | 2 | < 0.1% |
| (Missing) | 42370 |
| Value | Count | Frequency (%) |
| 0.3 | 6 | < 0.1% |
| 1 | 3 | < 0.1% |
| 10 | 9 | < 0.1% |
| 40 | 2 | < 0.1% |
| 50 | 21 | |
| 100 | 28 | |
| 150 | 3 | < 0.1% |
| 200 | 30 | |
| 500 | 22 | |
| 1000 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 1000 | 3 | < 0.1% |
| 500 | 22 | |
| 200 | 30 | |
| 150 | 3 | < 0.1% |
| 100 | 28 | |
| 50 | 21 | |
| 40 | 2 | < 0.1% |
| 10 | 9 | < 0.1% |
| 1 | 3 | < 0.1% |
| 0.3 | 6 | < 0.1% |
JV_scan_integration_time
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | 20.0% |
| Missing | 42477 |
| Missing (%) | > 99.9% |
| Memory size | 332.1 KiB |
| 30.0 | |
|---|---|
| 20.0 | |
| 16.67 | |
| 3.0 |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.05 |
| Min length | 3 |
Characters and Unicode
| Total characters | 81 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 16.67 |
|---|---|
| 2nd row | 16.67 |
| 3rd row | 16.67 |
| 4th row | 16.67 |
| 5th row | 30.0 |
Common Values
| Value | Count | Frequency (%) |
| 30.0 | 7 | < 0.1% |
| 20.0 | 6 | < 0.1% |
| 16.67 | 4 | < 0.1% |
| 3.0 | 3 | < 0.1% |
| (Missing) | 42477 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 30.0 | 7 | |
| 20.0 | 6 | |
| 16.67 | 4 | |
| 3.0 | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 29 | |
| . | 20 | |
| 3 | 10 | 12.3% |
| 6 | 8 | 9.9% |
| 2 | 6 | 7.4% |
| 1 | 4 | 4.9% |
| 7 | 4 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 61 | |
| Other Punctuation | 20 | 24.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 29 | |
| 3 | 10 | 16.4% |
| 6 | 8 | 13.1% |
| 2 | 6 | 9.8% |
| 1 | 4 | 6.6% |
| 7 | 4 | 6.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 20 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 81 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 29 | |
| . | 20 | |
| 3 | 10 | 12.3% |
| 6 | 8 | 9.9% |
| 2 | 6 | 7.4% |
| 1 | 4 | 4.9% |
| 7 | 4 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 81 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 29 | |
| . | 20 | |
| 3 | 10 | 12.3% |
| 6 | 8 | 9.9% |
| 2 | 6 | 7.4% |
| 1 | 4 | 4.9% |
| 7 | 4 | 4.9% |
JV_preconditioning_protocol
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 14 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 41497 |
| Missing (%) | 97.6% |
| Memory size | 332.1 KiB |
| Light soaking | |
|---|---|
| Potential biasing | |
| Light soaking; Potential biasing | 26 |
| Heating | 19 |
| Cooling | 18 |
| Other values (9) | 41 |
Length
| Max length | 32 |
|---|---|
| Median length | 31 |
| Mean length | 15.066 |
| Min length | 4 |
Characters and Unicode
| Total characters | 15066 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Light soaking |
|---|---|
| 2nd row | Light soaking |
| 3rd row | Light soaking |
| 4th row | Light soaking |
| 5th row | Light soaking |
Common Values
| Value | Count | Frequency (%) |
| Light soaking | 479 | 1.1% |
| Potential biasing | 417 | 1.0% |
| Light soaking; Potential biasing | 26 | 0.1% |
| Heating | 19 | < 0.1% |
| Cooling | 18 | < 0.1% |
| Voc stabilization | 12 | < 0.1% |
| MPPT | 9 | < 0.1% |
| Heating; Light soaking | 5 | < 0.1% |
| Light Soaking; Potential biasing | 4 | < 0.1% |
| Bending | 3 | < 0.1% |
| Other values (4) | 8 | < 0.1% |
| (Missing) | 41497 |
Length
| Value | Count | Frequency (%) |
| light | 517 | |
| soaking | 517 | |
| potential | 449 | |
| biasing | 447 | |
| heating | 24 | 1.2% |
| cooling | 18 | 0.9% |
| voc | 12 | 0.6% |
| stabilization | 12 | 0.6% |
| mppt | 9 | 0.4% |
| bending | 3 | 0.1% |
| Other values (4) | 9 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 2462 | |
| g | 1528 | |
| n | 1490 | |
| t | 1467 | |
| a | 1463 | |
| o | 1031 | 6.8% |
| 1017 | 6.8% | |
| s | 975 | 6.5% |
| k | 522 | 3.5% |
| L | 517 | 3.4% |
| Other values (23) | 2594 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12943 | |
| Uppercase Letter | 1069 | 7.1% |
| Space Separator | 1017 | 6.8% |
| Other Punctuation | 37 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2462 | |
| g | 1528 | |
| n | 1490 | |
| t | 1467 | |
| a | 1463 | |
| o | 1031 | |
| s | 975 | 7.5% |
| k | 522 | 4.0% |
| h | 517 | 4.0% |
| e | 490 | 3.8% |
| Other values (10) | 998 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 517 | |
| P | 467 | |
| H | 24 | 2.2% |
| C | 18 | 1.7% |
| V | 12 | 1.1% |
| M | 9 | 0.8% |
| T | 9 | 0.8% |
| S | 5 | 0.5% |
| B | 3 | 0.3% |
| U | 3 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 1017 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 37 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14012 | |
| Common | 1054 | 7.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 2462 | |
| g | 1528 | |
| n | 1490 | |
| t | 1467 | |
| a | 1463 | |
| o | 1031 | |
| s | 975 | 7.0% |
| k | 522 | 3.7% |
| L | 517 | 3.7% |
| h | 517 | 3.7% |
| Other values (21) | 2040 |
Common
| Value | Count | Frequency (%) |
| 1017 | ||
| ; | 37 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15066 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 2462 | |
| g | 1528 | |
| n | 1490 | |
| t | 1467 | |
| a | 1463 | |
| o | 1031 | 6.8% |
| 1017 | 6.8% | |
| s | 975 | 6.5% |
| k | 522 | 3.5% |
| L | 517 | 3.4% |
| Other values (23) | 2594 |
JV_preconditioning_time
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 21 |
|---|---|
| Distinct (%) | 9.6% |
| Missing | 42279 |
| Missing (%) | 99.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 596.3211 |
| Minimum | 2 |
|---|---|
| Maximum | 6000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 332.1 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 10 |
| median | 60 |
| Q3 | 600 |
| 95-th percentile | 1800 |
| Maximum | 6000 |
| Range | 5998 |
| Interquartile range (IQR) | 590 |
Descriptive statistics
| Standard deviation | 1199.0264 |
|---|---|
| Coefficient of variation (CV) | 2.010706 |
| Kurtosis | 10.447455 |
| Mean | 596.3211 |
| Median Absolute Deviation (MAD) | 58 |
| Skewness | 3.1682549 |
| Sum | 129998 |
| Variance | 1437664.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 33 | 0.1% |
| 600 | 25 | 0.1% |
| 1800 | 22 | 0.1% |
| 2 | 22 | 0.1% |
| 60 | 21 | < 0.1% |
| 5 | 14 | < 0.1% |
| 30 | 13 | < 0.1% |
| 900 | 13 | < 0.1% |
| 300 | 10 | < 0.1% |
| 3 | 8 | < 0.1% |
| Other values (11) | 37 | 0.1% |
| (Missing) | 42279 |
| Value | Count | Frequency (%) |
| 2 | 22 | |
| 3 | 8 | < 0.1% |
| 5 | 14 | |
| 10 | 33 | |
| 20 | 3 | < 0.1% |
| 30 | 13 | < 0.1% |
| 40 | 5 | < 0.1% |
| 45 | 8 | < 0.1% |
| 60 | 21 | |
| 120 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 6000 | 5 | < 0.1% |
| 4800 | 5 | < 0.1% |
| 1800 | 22 | |
| 1200 | 1 | < 0.1% |
| 900 | 13 | |
| 600 | 25 | |
| 500 | 3 | < 0.1% |
| 360 | 1 | < 0.1% |
| 300 | 10 | < 0.1% |
| 240 | 1 | < 0.1% |
JV_preconditioning_potential
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 19 |
|---|---|
| Distinct (%) | 14.1% |
| Missing | 42362 |
| Missing (%) | 99.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.85274074 |
| Minimum | -1.5 |
|---|---|
| Maximum | 3 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Negative | 31 |
| Negative (%) | 0.1% |
| Memory size | 332.1 KiB |
Quantile statistics
| Minimum | -1.5 |
|---|---|
| 5-th percentile | -0.9 |
| Q1 | 0 |
| median | 1.2 |
| Q3 | 1.4 |
| 95-th percentile | 2 |
| Maximum | 3 |
| Range | 4.5 |
| Interquartile range (IQR) | 1.4 |
Descriptive statistics
| Standard deviation | 0.91148512 |
|---|---|
| Coefficient of variation (CV) | 1.0688889 |
| Kurtosis | -0.021268119 |
| Mean | 0.85274074 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | -0.91484955 |
| Sum | 115.12 |
| Variance | 0.83080512 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.2 | 41 | 0.1% |
| 1.4 | 28 | 0.1% |
| 2 | 11 | < 0.1% |
| -0.5 | 7 | < 0.1% |
| -0.1 | 6 | < 0.1% |
| 0.97 | 6 | < 0.1% |
| 1.5 | 6 | < 0.1% |
| 0 | 4 | < 0.1% |
| -0.3 | 4 | < 0.1% |
| -0.7 | 4 | < 0.1% |
| Other values (9) | 18 | < 0.1% |
| (Missing) | 42362 |
| Value | Count | Frequency (%) |
| -1.5 | 2 | < 0.1% |
| -1.2 | 3 | |
| -1 | 1 | < 0.1% |
| -0.9 | 4 | |
| -0.7 | 4 | |
| -0.5 | 7 | |
| -0.3 | 4 | |
| -0.1 | 6 | |
| 0 | 4 | |
| 0.5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 3 | 1 | < 0.1% |
| 2 | 11 | < 0.1% |
| 1.5 | 6 | < 0.1% |
| 1.4 | 28 | |
| 1.3 | 2 | < 0.1% |
| 1.2 | 41 | |
| 1 | 2 | < 0.1% |
| 0.97 | 6 | < 0.1% |
| 0.6 | 1 | < 0.1% |
| 0.5 | 2 | < 0.1% |
JV_default_PCE
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 2318 |
|---|---|
| Distinct (%) | 5.6% |
| Missing | 925 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.021482 |
| Minimum | 0 |
|---|---|
| Maximum | 36.2 |
| Zeros | 74 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 332.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8.45 |
| median | 12.72 |
| Q3 | 16.1 |
| 95-th percentile | 19.35 |
| Maximum | 36.2 |
| Range | 36.2 |
| Interquartile range (IQR) | 7.65 |
Descriptive statistics
| Standard deviation | 5.2368397 |
|---|---|
| Coefficient of variation (CV) | 0.43562346 |
| Kurtosis | -0.50177351 |
| Mean | 12.021482 |
| Median Absolute Deviation (MAD) | 3.72 |
| Skewness | -0.42513749 |
| Sum | 499757.06 |
| Variance | 27.42449 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14 | 186 | 0.4% |
| 13 | 181 | 0.4% |
| 15 | 177 | 0.4% |
| 15.2 | 171 | 0.4% |
| 12 | 167 | 0.4% |
| 14.2 | 163 | 0.4% |
| 14.5 | 153 | 0.4% |
| 13.3 | 152 | 0.4% |
| 16 | 151 | 0.4% |
| 17 | 149 | 0.4% |
| Other values (2308) | 39922 | |
| (Missing) | 925 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 74 | |
| 0.000225144 | 1 | < 0.1% |
| 0.00042 | 1 | < 0.1% |
| 0.0007 | 1 | < 0.1% |
| 0.00083 | 1 | < 0.1% |
| 0.001 | 1 | < 0.1% |
| 0.0013804 | 1 | < 0.1% |
| 0.002 | 1 | < 0.1% |
| 0.003 | 1 | < 0.1% |
| 0.004 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 36.2 | 1 | < 0.1% |
| 35.9 | 1 | < 0.1% |
| 35.2 | 3 | |
| 34.8 | 1 | < 0.1% |
| 33.4 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 32.3 | 1 | < 0.1% |
| 31.4 | 1 | < 0.1% |
| 30.6 | 1 | < 0.1% |
| 30.3 | 1 | < 0.1% |
| Cell_area_measured | Module_area_total | Perovskite_deposition_number_of_deposition_steps | JV_light_intensity | JV_scan_speed | JV_scan_delay_time | JV_preconditioning_time | JV_preconditioning_potential | JV_default_PCE | Cell_architecture | Cell_flexible | Module | ETL_surface_treatment_before_next_deposition_step | Perovskite_dimension_2D | Perovskite_dimension_3D | Perovskite_dimension_3D_with_2D_capping_layer | Perovskite_composition_perovskite_ABC3_structure | Perovskite_composition_b_ions | Perovskite_composition_c_ions | Perovskite_composition_none_stoichiometry_components_in_excess | Perovskite_band_gap_graded | Perovskite_deposition_quenching_induced_crystallisation | Perovskite_deposition_quenching_media | Perovskite_deposition_quenching_media_additives_compounds | Perovskite_deposition_thermal_annealing_atmosphere | Perovskite_deposition_solvent_annealing | Perovskite_deposition_solvent_annealing_solvent_atmosphere | Perovskite_surface_treatment_before_next_deposition_step | HTL_deposition_synthesis_atmosphere | HTL_deposition_solvents | HTL_deposition_solvents_mixing_ratios | Add_lay_front_stack_sequence | Add_lay_back | Encapsulation | JV_light_spectra | JV_scan_integration_time | JV_preconditioning_protocol | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cell_area_measured | 1.000 | 0.749 | -0.012 | -0.053 | -0.139 | 0.519 | 0.045 | -0.170 | 0.017 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 |
| Module_area_total | 0.749 | 1.000 | 0.134 | NaN | 0.033 | NaN | 0.893 | NaN | 0.335 | 0.000 | 0.202 | 0.121 | 1.000 | 1.000 | 0.292 | 0.292 | 0.000 | 0.000 | 0.082 | 0.000 | 1.000 | 0.222 | 0.171 | 1.000 | 0.300 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.150 | 0.000 | 0.341 | 0.000 | 0.000 | 0.987 |
| Perovskite_deposition_number_of_deposition_steps | -0.012 | 0.134 | 1.000 | -0.017 | 0.022 | 0.263 | -0.015 | 0.476 | -0.089 | 0.058 | 0.011 | 0.041 | 0.286 | 0.085 | 0.077 | 0.087 | 0.000 | 0.494 | 0.366 | 0.243 | 0.300 | 0.500 | 0.185 | 0.000 | 0.296 | 0.011 | 0.017 | 0.000 | 0.059 | 0.022 | 0.000 | 0.000 | 0.012 | 0.025 | 0.031 | 0.801 | 0.145 |
| JV_light_intensity | -0.053 | NaN | -0.017 | 1.000 | 0.004 | NaN | NaN | NaN | -0.008 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.010 | 0.000 | 1.000 | 0.030 | 0.012 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 |
| JV_scan_speed | -0.139 | 0.033 | 0.022 | 0.004 | 1.000 | -0.309 | -0.191 | 0.092 | -0.059 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.044 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 |
| JV_scan_delay_time | 0.519 | NaN | 0.263 | NaN | -0.309 | 1.000 | NaN | NaN | -0.298 | 0.170 | 0.363 | 1.000 | 1.000 | 0.551 | 0.551 | 1.000 | 1.000 | 0.171 | 0.221 | 0.558 | 1.000 | 0.265 | 0.246 | 1.000 | 0.377 | 0.104 | 1.000 | 0.000 | 0.310 | 0.424 | 0.975 | 1.000 | 1.000 | 0.117 | 0.000 | 0.000 | 0.000 |
| JV_preconditioning_time | 0.045 | 0.893 | -0.015 | NaN | -0.191 | NaN | 1.000 | 0.030 | -0.258 | 0.299 | 1.000 | 0.084 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.505 | 0.473 | 1.000 | 1.000 | 0.569 | 0.657 | 1.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 1.000 | 0.893 | 1.000 | 0.000 | 0.705 |
| JV_preconditioning_potential | -0.170 | NaN | 0.476 | NaN | 0.092 | NaN | 0.030 | 1.000 | 0.147 | 0.493 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.157 | 0.166 | 0.472 | 1.000 | 0.396 | 0.045 | 0.000 | 0.402 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 1.000 | 0.000 | 0.287 |
| JV_default_PCE | 0.017 | 0.335 | -0.089 | -0.008 | -0.059 | -0.298 | -0.258 | 0.147 | 1.000 | 0.041 | 0.045 | 0.040 | 0.298 | 0.121 | 0.127 | 0.129 | 0.172 | 0.144 | 0.134 | 0.191 | 0.006 | 0.348 | 0.129 | 0.335 | 0.034 | 0.047 | 0.018 | 1.000 | 0.023 | 0.031 | 0.194 | 0.038 | 0.009 | 0.039 | 0.395 | 0.322 | 0.435 |
| Cell_architecture | 0.000 | 0.000 | 0.058 | 0.000 | 0.000 | 0.170 | 0.299 | 0.493 | 0.041 | 1.000 | 0.094 | 0.039 | 0.317 | 0.062 | 0.043 | 0.000 | 0.021 | 0.133 | 0.066 | 0.136 | 0.009 | 0.062 | 0.083 | 0.241 | 0.013 | 0.112 | 0.108 | 0.000 | 0.035 | 0.063 | 0.409 | 0.000 | 0.000 | 0.072 | 0.008 | 0.841 | 0.147 |
| Cell_flexible | 0.000 | 0.202 | 0.011 | 0.000 | 0.000 | 0.363 | 1.000 | 1.000 | 0.045 | 0.094 | 1.000 | 0.026 | 0.654 | 0.017 | 0.022 | 0.008 | 0.008 | 0.043 | 0.009 | 0.124 | 0.000 | 0.022 | 0.119 | 0.353 | 0.003 | 0.013 | 0.000 | 1.000 | 0.051 | 0.077 | 0.262 | 0.072 | 0.000 | 0.042 | 0.000 | 1.000 | 0.363 |
| Module | 0.000 | 0.121 | 0.041 | 0.000 | 0.000 | 1.000 | 0.084 | 1.000 | 0.040 | 0.039 | 0.026 | 1.000 | 0.317 | 0.012 | 0.010 | 0.000 | 0.006 | 0.000 | 0.000 | 0.071 | 0.000 | 0.026 | 0.058 | 0.000 | 0.062 | 0.012 | 0.070 | 1.000 | 0.027 | 0.016 | 0.148 | 0.000 | 0.023 | 0.057 | 0.000 | 1.000 | 0.183 |
| ETL_surface_treatment_before_next_deposition_step | 1.000 | 1.000 | 0.286 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.298 | 0.317 | 0.654 | 0.317 | 1.000 | 1.000 | 0.123 | 0.000 | 1.000 | 0.000 | 0.241 | 0.470 | 1.000 | 0.550 | 0.479 | 0.975 | 0.588 | 0.000 | 1.000 | 0.000 | 0.685 | 0.398 | 0.967 | 1.000 | 1.000 | 0.561 | 1.000 | 1.000 | 0.000 |
| Perovskite_dimension_2D | 0.000 | 1.000 | 0.085 | 0.000 | 0.000 | 0.551 | 1.000 | 1.000 | 0.121 | 0.062 | 0.017 | 0.012 | 1.000 | 1.000 | 0.854 | 0.008 | 0.113 | 0.121 | 0.083 | 0.292 | 0.016 | 0.067 | 0.062 | 0.609 | 0.032 | 0.001 | 0.000 | 1.000 | 0.027 | 0.000 | 0.000 | 0.000 | 0.000 | 0.021 | 0.000 | 1.000 | 1.000 |
| Perovskite_dimension_3D | 0.000 | 0.292 | 0.077 | 0.000 | 0.000 | 0.551 | 1.000 | 1.000 | 0.127 | 0.043 | 0.022 | 0.010 | 0.123 | 0.854 | 1.000 | 0.025 | 0.123 | 0.119 | 0.081 | 0.355 | 0.000 | 0.067 | 0.089 | 0.201 | 0.077 | 0.010 | 0.000 | 1.000 | 0.060 | 0.039 | 0.000 | 0.000 | 0.000 | 0.009 | 0.002 | 1.000 | 0.000 |
| Perovskite_dimension_3D_with_2D_capping_layer | 0.000 | 0.292 | 0.087 | 0.000 | 0.000 | 1.000 | 1.000 | 1.000 | 0.129 | 0.000 | 0.008 | 0.000 | 0.000 | 0.008 | 0.025 | 1.000 | 0.006 | 0.619 | 0.676 | 0.145 | 0.000 | 0.035 | 0.126 | 0.671 | 0.008 | 0.007 | 0.000 | 1.000 | 0.014 | 0.006 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 |
| Perovskite_composition_perovskite_ABC3_structure | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 1.000 | 0.172 | 0.021 | 0.008 | 0.006 | 1.000 | 0.113 | 0.123 | 0.006 | 1.000 | 0.553 | 0.151 | 0.282 | 0.000 | 0.065 | 0.105 | 0.506 | 0.000 | 0.000 | 0.000 | 1.000 | 0.037 | 0.066 | 0.000 | 0.000 | 0.000 | 0.011 | 0.000 | 1.000 | 0.000 |
| Perovskite_composition_b_ions | 0.000 | 0.000 | 0.494 | 0.000 | 0.000 | 0.171 | 0.505 | 0.157 | 0.144 | 0.133 | 0.043 | 0.000 | 0.000 | 0.121 | 0.119 | 0.619 | 0.553 | 1.000 | 0.439 | 0.426 | 0.395 | 0.108 | 0.019 | 0.536 | 0.058 | 0.000 | 0.000 | 1.000 | 0.146 | 0.108 | 0.559 | 0.143 | 0.000 | 0.123 | 0.000 | 0.943 | 0.000 |
| Perovskite_composition_c_ions | 0.000 | 0.082 | 0.366 | 0.000 | 0.000 | 0.221 | 0.473 | 0.166 | 0.134 | 0.066 | 0.009 | 0.000 | 0.241 | 0.083 | 0.081 | 0.676 | 0.151 | 0.439 | 1.000 | 0.378 | 0.377 | 0.284 | 0.076 | 0.602 | 0.167 | 0.006 | 0.000 | 0.000 | 0.089 | 0.035 | 0.405 | 0.034 | 0.000 | 0.071 | 0.000 | 0.000 | 0.081 |
| Perovskite_composition_none_stoichiometry_components_in_excess | 1.000 | 0.000 | 0.243 | 0.000 | 1.000 | 0.558 | 1.000 | 0.472 | 0.191 | 0.136 | 0.124 | 0.071 | 0.470 | 0.292 | 0.355 | 0.145 | 0.282 | 0.426 | 0.378 | 1.000 | 0.042 | 0.579 | 0.127 | 0.392 | 0.149 | 0.176 | 0.000 | 1.000 | 0.146 | 0.094 | 0.159 | 0.057 | 0.344 | 0.137 | 0.291 | 0.000 | 0.422 |
| Perovskite_band_gap_graded | 0.000 | 1.000 | 0.300 | 0.000 | 0.000 | 1.000 | 1.000 | 1.000 | 0.006 | 0.009 | 0.000 | 0.000 | 1.000 | 0.016 | 0.000 | 0.000 | 0.000 | 0.395 | 0.377 | 0.042 | 1.000 | 0.006 | 0.000 | 1.000 | 0.363 | 0.000 | 0.000 | 1.000 | 0.170 | 0.139 | 0.000 | 0.000 | 0.000 | 0.004 | 0.000 | 0.943 | 1.000 |
| Perovskite_deposition_quenching_induced_crystallisation | 0.000 | 0.222 | 0.500 | 0.010 | 0.000 | 0.265 | 0.569 | 0.396 | 0.348 | 0.062 | 0.022 | 0.026 | 0.550 | 0.067 | 0.067 | 0.035 | 0.065 | 0.108 | 0.284 | 0.579 | 0.006 | 1.000 | 0.991 | 0.252 | 0.090 | 0.010 | 0.065 | 0.000 | 0.067 | 0.076 | 0.232 | 0.045 | 0.014 | 0.011 | 0.029 | 0.943 | 0.230 |
| Perovskite_deposition_quenching_media | 0.000 | 0.171 | 0.185 | 0.000 | 0.044 | 0.246 | 0.657 | 0.045 | 0.129 | 0.083 | 0.119 | 0.058 | 0.479 | 0.062 | 0.089 | 0.126 | 0.105 | 0.019 | 0.076 | 0.127 | 0.000 | 0.991 | 1.000 | 0.501 | 0.061 | 0.295 | 0.097 | 0.707 | 0.032 | 0.182 | 0.418 | 0.000 | 0.000 | 0.084 | 0.085 | 0.777 | 0.305 |
| Perovskite_deposition_quenching_media_additives_compounds | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 | 0.335 | 0.241 | 0.353 | 0.000 | 0.975 | 0.609 | 0.201 | 0.671 | 0.506 | 0.536 | 0.602 | 0.392 | 1.000 | 0.252 | 0.501 | 1.000 | 0.378 | 0.000 | 0.000 | 0.000 | 0.229 | 0.199 | 0.370 | 0.379 | 1.000 | 0.516 | 0.917 | 0.000 | 1.000 |
| Perovskite_deposition_thermal_annealing_atmosphere | 0.000 | 0.300 | 0.296 | 0.030 | 0.000 | 0.377 | 0.000 | 0.402 | 0.034 | 0.013 | 0.003 | 0.062 | 0.588 | 0.032 | 0.077 | 0.008 | 0.000 | 0.058 | 0.167 | 0.149 | 0.363 | 0.090 | 0.061 | 0.378 | 1.000 | 0.116 | 0.032 | 0.000 | 0.269 | 0.178 | 0.190 | 0.067 | 0.000 | 0.062 | 0.103 | 0.970 | 0.035 |
| Perovskite_deposition_solvent_annealing | 0.000 | 0.000 | 0.011 | 0.012 | 0.000 | 0.104 | 1.000 | 1.000 | 0.047 | 0.112 | 0.013 | 0.012 | 0.000 | 0.001 | 0.010 | 0.007 | 0.000 | 0.000 | 0.006 | 0.176 | 0.000 | 0.010 | 0.295 | 0.000 | 0.116 | 1.000 | 0.874 | 0.000 | 0.065 | 0.126 | 0.470 | 0.000 | 0.003 | 0.006 | 0.000 | 1.000 | 0.000 |
| Perovskite_deposition_solvent_annealing_solvent_atmosphere | 0.000 | 0.000 | 0.017 | 0.000 | 0.000 | 1.000 | 1.000 | 1.000 | 0.018 | 0.108 | 0.000 | 0.070 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.065 | 0.097 | 0.000 | 0.032 | 0.874 | 1.000 | 0.000 | 0.088 | 0.083 | 0.418 | 0.000 | 0.140 | 0.080 | 0.000 | 1.000 | 0.000 |
| Perovskite_surface_treatment_before_next_deposition_step | 1.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.707 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 | NaN |
| HTL_deposition_synthesis_atmosphere | 0.000 | 0.000 | 0.059 | 0.000 | 0.000 | 0.310 | 0.000 | 0.000 | 0.023 | 0.035 | 0.051 | 0.027 | 0.685 | 0.027 | 0.060 | 0.014 | 0.037 | 0.146 | 0.089 | 0.146 | 0.170 | 0.067 | 0.032 | 0.229 | 0.269 | 0.065 | 0.088 | 0.000 | 1.000 | 0.684 | 0.485 | 0.046 | 0.000 | 0.051 | 0.000 | 1.000 | 0.163 |
| HTL_deposition_solvents | 0.000 | 0.000 | 0.022 | 0.000 | 0.000 | 0.424 | 0.000 | 0.000 | 0.031 | 0.063 | 0.077 | 0.016 | 0.398 | 0.000 | 0.039 | 0.006 | 0.066 | 0.108 | 0.035 | 0.094 | 0.139 | 0.076 | 0.182 | 0.199 | 0.178 | 0.126 | 0.083 | 1.000 | 0.684 | 1.000 | 0.936 | 0.066 | 0.000 | 0.058 | 0.083 | 1.000 | 0.568 |
| HTL_deposition_solvents_mixing_ratios | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 0.975 | 1.000 | 0.000 | 0.194 | 0.409 | 0.262 | 0.148 | 0.967 | 0.000 | 0.000 | 0.000 | 0.000 | 0.559 | 0.405 | 0.159 | 0.000 | 0.232 | 0.418 | 0.370 | 0.190 | 0.470 | 0.418 | 0.000 | 0.485 | 0.936 | 1.000 | 1.000 | 1.000 | 0.223 | 1.000 | 1.000 | 1.000 |
| Add_lay_front_stack_sequence | 0.000 | 0.150 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 1.000 | 0.038 | 0.000 | 0.072 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.143 | 0.034 | 0.057 | 0.000 | 0.045 | 0.000 | 0.379 | 0.067 | 0.000 | 0.000 | 1.000 | 0.046 | 0.066 | 1.000 | 1.000 | 0.599 | 0.086 | 0.000 | 1.000 | 0.535 |
| Add_lay_back | 0.000 | 0.000 | 0.012 | 0.000 | 0.000 | 1.000 | 1.000 | 1.000 | 0.009 | 0.000 | 0.000 | 0.023 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.344 | 0.000 | 0.014 | 0.000 | 1.000 | 0.000 | 0.003 | 0.140 | 1.000 | 0.000 | 0.000 | 1.000 | 0.599 | 1.000 | 0.038 | 0.000 | 1.000 | 1.000 |
| Encapsulation | 0.000 | 0.341 | 0.025 | 0.000 | 0.000 | 0.117 | 0.893 | 0.000 | 0.039 | 0.072 | 0.042 | 0.057 | 0.561 | 0.021 | 0.009 | 0.000 | 0.011 | 0.123 | 0.071 | 0.137 | 0.004 | 0.011 | 0.084 | 0.516 | 0.062 | 0.006 | 0.080 | 1.000 | 0.051 | 0.058 | 0.223 | 0.086 | 0.038 | 1.000 | 0.042 | 1.000 | 0.211 |
| JV_light_spectra | 0.000 | 0.000 | 0.031 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.395 | 0.008 | 0.000 | 0.000 | 1.000 | 0.000 | 0.002 | 0.000 | 0.000 | 0.000 | 0.000 | 0.291 | 0.000 | 0.029 | 0.085 | 0.917 | 0.103 | 0.000 | 0.000 | 1.000 | 0.000 | 0.083 | 1.000 | 0.000 | 0.000 | 0.042 | 1.000 | 1.000 | 1.000 |
| JV_scan_integration_time | 1.000 | 0.000 | 0.801 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.322 | 0.841 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.943 | 0.000 | 0.000 | 0.943 | 0.943 | 0.777 | 0.000 | 0.970 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 |
| JV_preconditioning_protocol | 1.000 | 0.987 | 0.145 | 0.000 | 1.000 | 0.000 | 0.705 | 0.287 | 0.435 | 0.147 | 0.363 | 0.183 | 0.000 | 1.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.081 | 0.422 | 1.000 | 0.230 | 0.305 | 1.000 | 0.035 | 0.000 | 0.000 | NaN | 0.163 | 0.568 | 1.000 | 0.535 | 1.000 | 0.211 | 1.000 | 0.000 | 1.000 |
| Ref_publication_date | Cell_area_measured | Cell_architecture | Cell_flexible | Module | Module_area_total | Substrate_stack_sequence | ETL_stack_sequence | ETL_thickness | ETL_additives_compounds | ETL_deposition_procedure | ETL_surface_treatment_before_next_deposition_step | Perovskite_dimension_2D | Perovskite_dimension_3D | Perovskite_dimension_3D_with_2D_capping_layer | Perovskite_composition_perovskite_ABC3_structure | Perovskite_composition_a_ions | Perovskite_composition_a_ions_coefficients | Perovskite_composition_b_ions | Perovskite_composition_b_ions_coefficients | Perovskite_composition_c_ions | Perovskite_composition_c_ions_coefficients | Perovskite_composition_none_stoichiometry_components_in_excess | Perovskite_additives_compounds | Perovskite_additives_concentrations | Perovskite_thickness | Perovskite_band_gap | Perovskite_band_gap_graded | Perovskite_pl_max | Perovskite_deposition_number_of_deposition_steps | Perovskite_deposition_procedure | Perovskite_deposition_synthesis_atmosphere | Perovskite_deposition_solvents | Perovskite_deposition_solvents_mixing_ratios | Perovskite_deposition_quenching_induced_crystallisation | Perovskite_deposition_quenching_media | Perovskite_deposition_quenching_media_mixing_ratios | Perovskite_deposition_quenching_media_additives_compounds | Perovskite_deposition_thermal_annealing_temperature | Perovskite_deposition_thermal_annealing_time | Perovskite_deposition_thermal_annealing_atmosphere | Perovskite_deposition_solvent_annealing | Perovskite_deposition_solvent_annealing_solvent_atmosphere | Perovskite_deposition_after_treatment_of_formed_perovskite | Perovskite_surface_treatment_before_next_deposition_step | HTL_stack_sequence | HTL_thickness_list | HTL_additives_compounds | HTL_additives_concentrations | HTL_deposition_procedure | HTL_deposition_synthesis_atmosphere | HTL_deposition_solvents | HTL_deposition_solvents_mixing_ratios | Backcontact_stack_sequence | Backcontact_thickness_list | Backcontact_deposition_procedure | Add_lay_front_stack_sequence | Add_lay_back | Encapsulation | Encapsulation_stack_sequence | JV_light_intensity | JV_light_spectra | JV_scan_speed | JV_scan_delay_time | JV_scan_integration_time | JV_preconditioning_protocol | JV_preconditioning_time | JV_preconditioning_potential | JV_default_PCE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2015-01-06 | 0.2 | nip | False | False | NaN | SLG | FTO | TiO2-c | TiO2-mp | 65.0 | nan | Unknown | Spray-pyrolys | Spin-coating | NaN | False | True | False | True | Cs | 1 | Sn | 1 | I | 3 | NaN | NaN | NaN | NaN | 1.27 | False | NaN | 1 | Spin-coating | N2 | DMSO | 1 | False | Unknown | NaN | NaN | 100 | 10.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | Li(CF3SO2)2N; TBP | NaN | Spin-coating | Unknown | Unknown | NaN | Au | 90.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | NaN | NaN | NaN | 0.00 |
| 1 | 2015-01-06 | 0.2 | nip | False | False | NaN | SLG | FTO | TiO2-c | TiO2-mp | 65.0 | nan | Unknown | Spray-pyrolys | Spin-coating | NaN | False | True | False | True | Cs | 1 | Sn | 1 | Br; I | 0.3; 2.7 | NaN | NaN | NaN | NaN | NaN | False | NaN | 1 | Spin-coating | N2 | DMSO | 1 | False | Unknown | NaN | NaN | 100 | 10.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | Li(CF3SO2)2N; TBP | NaN | Spin-coating | Unknown | Unknown | NaN | Au | 90.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | NaN | NaN | NaN | 0.00 |
| 2 | 2015-01-06 | 0.2 | nip | False | False | NaN | SLG | FTO | TiO2-c | TiO2-mp | 65.0 | nan | Unknown | Spray-pyrolys | Spin-coating | NaN | False | True | False | True | Cs | 1 | Sn | 1 | Br; I | 1.5; 1.5 | NaN | NaN | NaN | NaN | NaN | False | NaN | 1 | Spin-coating | N2 | DMSO | 1 | False | Unknown | NaN | NaN | 100 | 10.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | Li(CF3SO2)2N; TBP | NaN | Spin-coating | Unknown | Unknown | NaN | Au | 90.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | NaN | NaN | NaN | 0.13 |
| 3 | 2015-01-06 | 0.2 | nip | False | False | NaN | SLG | FTO | TiO2-c | TiO2-mp | 65.0 | nan | Unknown | Spray-pyrolys | Spin-coating | NaN | False | True | False | True | Cs | 1 | Sn | 1 | Br; I | 2.7; 0.3 | NaN | NaN | NaN | NaN | NaN | False | NaN | 1 | Spin-coating | N2 | DMSO | 1 | False | Unknown | NaN | NaN | 100 | 10.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | Li(CF3SO2)2N; TBP | NaN | Spin-coating | Unknown | Unknown | NaN | Au | 90.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | NaN | NaN | NaN | 0.12 |
| 4 | 2015-01-06 | 0.2 | nip | False | False | NaN | SLG | FTO | TiO2-c | TiO2-mp | 65.0 | nan | Unknown | Spray-pyrolys | Spin-coating | NaN | False | True | False | True | Cs | 1 | Sn | 1 | Br | 3 | NaN | NaN | NaN | NaN | 1.75 | False | NaN | 1 | Spin-coating | N2 | DMSO | 1 | False | Unknown | NaN | NaN | 100 | 10.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | Li(CF3SO2)2N; TBP | NaN | Spin-coating | Unknown | Unknown | NaN | Au | 90.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | NaN | NaN | NaN | 0.10 |
| 5 | 2015-01-06 | 0.2 | nip | False | False | NaN | SLG | FTO | TiO2-c | TiO2-mp | 65.0 | nan | Unknown | Spray-pyrolys | Spin-coating | NaN | False | True | False | True | Cs | 1 | Sn | 1 | I | 3 | NaN | SnF2 | 0.2 | NaN | 1.27 | False | NaN | 1 | Spin-coating | N2 | DMSO | 1 | False | Unknown | NaN | NaN | 100 | 10.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | Li(CF3SO2)2N; TBP | NaN | Spin-coating | Unknown | Unknown | NaN | Au | 90.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | NaN | NaN | NaN | 1.66 |
| 6 | 2015-01-06 | 0.2 | nip | False | False | NaN | SLG | FTO | TiO2-c | TiO2-mp | 65.0 | nan | Unknown | Spray-pyrolys | Spin-coating | NaN | False | True | False | True | Cs | 1 | Sn | 1 | Br; I | 1; 2 | NaN | SnF2 | 0.2 | NaN | 1.37 | False | NaN | 1 | Spin-coating | N2 | DMSO | 1 | False | Unknown | NaN | NaN | 100 | 10.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | Li(CF3SO2)2N; TBP | NaN | Spin-coating | Unknown | Unknown | NaN | Au | 90.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | NaN | NaN | NaN | 1.67 |
| 7 | 2015-01-06 | 0.2 | nip | False | False | NaN | SLG | FTO | TiO2-c | TiO2-mp | 65.0 | nan | Unknown | Spray-pyrolys | Spin-coating | NaN | False | True | False | True | Cs | 1 | Sn | 1 | Br; I | 2; 1 | NaN | SnF2 | 0.2 | NaN | 1.65 | False | NaN | 1 | Spin-coating | N2 | DMSO | 1 | False | Unknown | NaN | NaN | 100 | 10.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | Li(CF3SO2)2N; TBP | NaN | Spin-coating | Unknown | Unknown | NaN | Au | 90.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | NaN | NaN | NaN | 1.56 |
| 8 | 2015-01-06 | 0.2 | nip | False | False | NaN | SLG | FTO | TiO2-c | TiO2-mp | 65.0 | nan | Unknown | Spray-pyrolys | Spin-coating | NaN | False | True | False | True | Cs | 1 | Sn | 1 | Br | 3 | NaN | SnF2 | 0.2 | NaN | 1.75 | False | NaN | 1 | Spin-coating | N2 | DMSO | 1 | False | Unknown | NaN | NaN | 100 | 10.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | Li(CF3SO2)2N; TBP | NaN | Spin-coating | Unknown | Unknown | NaN | Au | 90.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | NaN | NaN | NaN | 0.95 |
| 9 | 2017-06-06 | 0.1 | nip | False | False | NaN | SLG | FTO | TiO2-c | TiO2-nw | nan | 400.0 | Unknown | CBD | Hydrothermal | NaN | False | True | False | True | MA | 1 | Pb | 1 | I | 3 | NaN | Cl | 0.25 | NaN | 1.6 | False | 775.0 | 1 | Spin-coating | Ar | DMF | 1 | False | Unknown | NaN | NaN | 110 | 60.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | Li-TFSI; TPB | NaN | Spin-coating | Unknown | Unknown | NaN | Au | 80.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | NaN | NaN | NaN | 10.30 |
| Ref_publication_date | Cell_area_measured | Cell_architecture | Cell_flexible | Module | Module_area_total | Substrate_stack_sequence | ETL_stack_sequence | ETL_thickness | ETL_additives_compounds | ETL_deposition_procedure | ETL_surface_treatment_before_next_deposition_step | Perovskite_dimension_2D | Perovskite_dimension_3D | Perovskite_dimension_3D_with_2D_capping_layer | Perovskite_composition_perovskite_ABC3_structure | Perovskite_composition_a_ions | Perovskite_composition_a_ions_coefficients | Perovskite_composition_b_ions | Perovskite_composition_b_ions_coefficients | Perovskite_composition_c_ions | Perovskite_composition_c_ions_coefficients | Perovskite_composition_none_stoichiometry_components_in_excess | Perovskite_additives_compounds | Perovskite_additives_concentrations | Perovskite_thickness | Perovskite_band_gap | Perovskite_band_gap_graded | Perovskite_pl_max | Perovskite_deposition_number_of_deposition_steps | Perovskite_deposition_procedure | Perovskite_deposition_synthesis_atmosphere | Perovskite_deposition_solvents | Perovskite_deposition_solvents_mixing_ratios | Perovskite_deposition_quenching_induced_crystallisation | Perovskite_deposition_quenching_media | Perovskite_deposition_quenching_media_mixing_ratios | Perovskite_deposition_quenching_media_additives_compounds | Perovskite_deposition_thermal_annealing_temperature | Perovskite_deposition_thermal_annealing_time | Perovskite_deposition_thermal_annealing_atmosphere | Perovskite_deposition_solvent_annealing | Perovskite_deposition_solvent_annealing_solvent_atmosphere | Perovskite_deposition_after_treatment_of_formed_perovskite | Perovskite_surface_treatment_before_next_deposition_step | HTL_stack_sequence | HTL_thickness_list | HTL_additives_compounds | HTL_additives_concentrations | HTL_deposition_procedure | HTL_deposition_synthesis_atmosphere | HTL_deposition_solvents | HTL_deposition_solvents_mixing_ratios | Backcontact_stack_sequence | Backcontact_thickness_list | Backcontact_deposition_procedure | Add_lay_front_stack_sequence | Add_lay_back | Encapsulation | Encapsulation_stack_sequence | JV_light_intensity | JV_light_spectra | JV_scan_speed | JV_scan_delay_time | JV_scan_integration_time | JV_preconditioning_protocol | JV_preconditioning_time | JV_preconditioning_potential | JV_default_PCE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 42487 | 2019-07-12 | 0.101 | nip | False | False | 0.25 | SLG | FTO | TiO2-c | TiO2-mp | 50.0 | 250.0 | TiCl4 | Spin-coating | Spin-coating >> Hydrothermal | NaN | False | True | False | True | MA | 1.0 | Pb | 1 | Br | 3.0 | Stoichiometric | Undoped | NaN | 400.0 | 2.3 | False | 535.0 | 0 | Spin-coating | N2 | DMF; DMSO | 4; 1 | True | Chlorobenzene | NaN | Undoped | 100 | 60.0 | N2 | False | Unknown | NaN | NaN | Spiro-MeOTAD | 200.0 | Li-TFSI; TBP | 14.2 mM; 8 vol% | Spin-coating | N2 | Chlorobenzene | NaN | Au | 60.0 | Evaporation | MgF2 | False | False | Cover glass-QDs | 100.0 | AM 1.5 | 100.0 | NaN | NaN | NaN | NaN | NaN | 1.950000 |
| 42488 | 2019-07-12 | 0.113 | nip | False | False | 0.25 | SLG | FTO | TiO2-c | TiO2-mp | 50.0 | 250.0 | TiCl4 | Spin-coating | Spin-coating >> Hydrothermal | NaN | False | True | False | True | MA | 1.0 | Ag; Pb | 0.005; 0.995 | Br | 3.0 | Stoichiometric | Undoped | NaN | 400.0 | 2.3 | False | 535.0 | 0 | Spin-coating | N2 | DMF; DMSO | 4; 1 | True | Chlorobenzene | NaN | Undoped | 100 | 60.0 | N2 | False | Unknown | NaN | NaN | Spiro-MeOTAD | 200.0 | Li-TFSI; TBP | 14.2 mM; 8 vol% | Spin-coating | N2 | Chlorobenzene | NaN | Au | 60.0 | Evaporation | MgF2 | False | False | Cover glass-QDs | 100.0 | AM 1.5 | 100.0 | NaN | NaN | NaN | NaN | NaN | 1.250000 |
| 42489 | 2019-07-12 | 0.102 | nip | False | False | 0.25 | SLG | FTO | TiO2-c | TiO2-mp | 50.0 | 250.0 | TiCl4 | Spin-coating | Spin-coating >> Hydrothermal | NaN | False | True | False | True | MA | 1.0 | Ag; Pb | 0.005; 0.995 | Br | 3.0 | Stoichiometric | Undoped | NaN | 400.0 | 2.3 | False | 535.0 | 0 | Spin-coating | N2 | DMF; DMSO | 4; 1 | True | Chlorobenzene | NaN | Undoped | 100 | 60.0 | N2 | False | Unknown | NaN | NaN | Spiro-MeOTAD | 200.0 | Li-TFSI; TBP | 14.2 mM; 8 vol% | Spin-coating | N2 | Chlorobenzene | NaN | Au | 60.0 | Evaporation | MgF2 | False | False | Cover glass-QDs | 100.0 | AM 1.5 | 100.0 | NaN | NaN | NaN | NaN | NaN | 2.410000 |
| 42490 | 2019-07-12 | 0.120 | nip | False | False | 0.25 | SLG | FTO | TiO2-c | TiO2-mp | 50.0 | 250.0 | TiCl4 | Spin-coating | Spin-coating >> Hydrothermal | NaN | False | True | False | True | MA | 1.0 | Ag; Pb | 0.1; 0.9 | Br | 3.0 | Stoichiometric | Undoped | NaN | 400.0 | 2.3 | False | 535.0 | 0 | Spin-coating | N2 | DMF; DMSO | 4; 1 | True | Chlorobenzene | NaN | Undoped | 100 | 60.0 | N2 | False | Unknown | NaN | NaN | Spiro-MeOTAD | 200.0 | Li-TFSI; TBP | 14.2 mM; 8 vol% | Spin-coating | N2 | Chlorobenzene | NaN | Au | 60.0 | Evaporation | MgF2 | False | False | Cover glass-QDs | 100.0 | AM 1.5 | 100.0 | NaN | NaN | NaN | NaN | NaN | 0.475000 |
| 42491 | 2019-07-12 | 0.074 | nip | False | False | 0.25 | SLG | FTO | TiO2-c | TiO2-mp | 50.0 | 250.0 | TiCl4 | Spin-coating | Spin-coating >> Hydrothermal | NaN | False | True | False | True | MA | 1.0 | Ag; Pb | 0.1; 0.9 | Br | 3.0 | Stoichiometric | Undoped | NaN | 400.0 | 2.3 | False | 535.0 | 0 | Spin-coating | N2 | DMF; DMSO | 4; 1 | True | Chlorobenzene | NaN | Undoped | 100 | 60.0 | N2 | False | Unknown | NaN | NaN | Spiro-MeOTAD | 200.0 | Li-TFSI; TBP | 14.2 mM; 8 vol% | Spin-coating | N2 | Chlorobenzene | NaN | Au | 60.0 | Evaporation | MgF2 | False | False | Cover glass-QDs | 100.0 | AM 1.5 | 100.0 | NaN | NaN | NaN | NaN | NaN | 0.891892 |
| 42492 | 2019-07-12 | 0.082 | nip | False | False | 0.25 | SLG | FTO | TiO2-c | TiO2-mp | 50.0 | 250.0 | TiCl4 | Spin-coating | Spin-coating >> Hydrothermal | NaN | False | True | False | True | MA | 1.0 | Ag; Pb | 0.1; 0.9 | Br | 3.0 | Stoichiometric | Undoped | NaN | 400.0 | 2.3 | False | 535.0 | 0 | Spin-coating | N2 | DMF; DMSO | 4; 1 | True | Chlorobenzene | NaN | Undoped | 100 | 60.0 | N2 | False | Unknown | NaN | NaN | Spiro-MeOTAD | 200.0 | Li-TFSI; TBP | 14.2 mM; 8 vol% | Spin-coating | N2 | Chlorobenzene | NaN | Au | 60.0 | Evaporation | MgF2 | False | False | Cover glass-QDs | 100.0 | AM 1.5 | 100.0 | NaN | NaN | NaN | NaN | NaN | 1.080000 |
| 42493 | 2019-07-12 | 0.083 | nip | False | False | 0.25 | SLG | FTO | TiO2-c | TiO2-mp | 50.0 | 250.0 | TiCl4 | Spin-coating | Spin-coating >> Hydrothermal | NaN | False | True | False | True | MA | 1.0 | Ag; Pb | 0.02; 0.98 | Br | 3.0 | Stoichiometric | Undoped | NaN | 400.0 | 2.3 | False | 535.0 | 0 | Spin-coating | N2 | DMF; DMSO | 4; 1 | True | Chlorobenzene | NaN | Undoped | 100 | 60.0 | N2 | False | Unknown | NaN | NaN | Spiro-MeOTAD | 200.0 | Li-TFSI; TBP | 14.2 mM; 8 vol% | Spin-coating | N2 | Chlorobenzene | NaN | Au | 60.0 | Evaporation | MgF2 | False | False | Cover glass-QDs | 100.0 | AM 1.5 | 100.0 | NaN | NaN | NaN | NaN | NaN | 1.610000 |
| 42494 | 2019-07-12 | 0.092 | nip | False | False | 0.25 | SLG | FTO | TiO2-c | TiO2-mp | 50.0 | 250.0 | TiCl4 | Spin-coating | Spin-coating >> Hydrothermal | NaN | False | True | False | True | MA | 1.0 | Ag; Pb | 0.02; 0.98 | Br | 3.0 | Stoichiometric | Undoped | NaN | 400.0 | 2.3 | False | 535.0 | 0 | Spin-coating | N2 | DMF; DMSO | 4; 1 | True | Chlorobenzene | NaN | Undoped | 100 | 60.0 | N2 | False | Unknown | NaN | NaN | Spiro-MeOTAD | 200.0 | Li-TFSI; TBP | 14.2 mM; 8 vol% | Spin-coating | N2 | Chlorobenzene | NaN | Au | 60.0 | Evaporation | MgF2 | False | False | Cover glass-QDs | 100.0 | AM 1.5 | 100.0 | NaN | NaN | NaN | NaN | NaN | 1.610000 |
| 42495 | 2019-07-12 | 0.093 | nip | False | False | 0.25 | SLG | FTO | TiO2-c | TiO2-mp | 50.0 | 250.0 | TiCl4 | Spin-coating | Spin-coating >> Hydrothermal | NaN | False | True | False | True | MA | 1.0 | Ag; Pb | 0.05; 0.95 | Br | 3.0 | Stoichiometric | Undoped | NaN | 400.0 | 2.3 | False | 535.0 | 0 | Spin-coating | N2 | DMF; DMSO | 4; 1 | True | Chlorobenzene | NaN | Undoped | 100 | 60.0 | N2 | False | Unknown | NaN | NaN | Spiro-MeOTAD | 200.0 | Li-TFSI; TBP | 14.2 mM; 8 vol% | Spin-coating | N2 | Chlorobenzene | NaN | Au | 60.0 | Evaporation | MgF2 | False | False | Cover glass-QDs | 100.0 | AM 1.5 | 100.0 | NaN | NaN | NaN | NaN | NaN | 2.000000 |
| 42496 | 2022-04-11 | 0.105 | Unknown | False | False | NaN | Unknown | Unknown | NaN | NaN | Unknown | NaN | False | False | False | False | NaN | NaN | Ag; Pb | 0.005; 0.995 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | False | NaN | 0 | Unknown | Unknown | Unknown | NaN | False | Unknown | NaN | NaN | Unknown | Unknown | Unknown | False | Unknown | NaN | NaN | Unknown | NaN | NaN | NaN | Unknown | Unknown | Unknown | NaN | Unknown | NaN | Unknown | Unknown | False | False | Unknown | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.000000 |
Most frequently occurring
| Ref_publication_date | Cell_area_measured | Cell_architecture | Cell_flexible | Module | Module_area_total | Substrate_stack_sequence | ETL_stack_sequence | ETL_thickness | ETL_additives_compounds | ETL_deposition_procedure | ETL_surface_treatment_before_next_deposition_step | Perovskite_dimension_2D | Perovskite_dimension_3D | Perovskite_dimension_3D_with_2D_capping_layer | Perovskite_composition_perovskite_ABC3_structure | Perovskite_composition_a_ions | Perovskite_composition_a_ions_coefficients | Perovskite_composition_b_ions | Perovskite_composition_b_ions_coefficients | Perovskite_composition_c_ions | Perovskite_composition_c_ions_coefficients | Perovskite_composition_none_stoichiometry_components_in_excess | Perovskite_additives_compounds | Perovskite_additives_concentrations | Perovskite_band_gap_graded | Perovskite_deposition_number_of_deposition_steps | Perovskite_deposition_procedure | Perovskite_deposition_synthesis_atmosphere | Perovskite_deposition_solvents | Perovskite_deposition_solvents_mixing_ratios | Perovskite_deposition_quenching_induced_crystallisation | Perovskite_deposition_quenching_media | Perovskite_deposition_quenching_media_additives_compounds | Perovskite_deposition_thermal_annealing_temperature | Perovskite_deposition_thermal_annealing_time | Perovskite_deposition_thermal_annealing_atmosphere | Perovskite_deposition_solvent_annealing | Perovskite_deposition_solvent_annealing_solvent_atmosphere | Perovskite_deposition_after_treatment_of_formed_perovskite | Perovskite_surface_treatment_before_next_deposition_step | HTL_stack_sequence | HTL_thickness_list | HTL_additives_compounds | HTL_additives_concentrations | HTL_deposition_procedure | HTL_deposition_synthesis_atmosphere | HTL_deposition_solvents | HTL_deposition_solvents_mixing_ratios | Backcontact_stack_sequence | Backcontact_thickness_list | Backcontact_deposition_procedure | Add_lay_front_stack_sequence | Add_lay_back | Encapsulation | Encapsulation_stack_sequence | JV_light_intensity | JV_light_spectra | JV_scan_speed | JV_scan_delay_time | JV_scan_integration_time | JV_preconditioning_protocol | JV_preconditioning_time | JV_preconditioning_potential | JV_default_PCE | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 891 | 2019-01-18 | 0.09 | nip | False | False | NaN | SLG | FTO | TiO2-c | TiO2-mp | NaN | Unknown | Spin-coating | Spin-coating | NaN | False | True | False | True | MA | 1 | Pb | 1 | I | 3 | NaN | NaN | NaN | False | 2 | Spin-coating >> Spin-coating | Air >> Air | DMF >> IPA | 1 >> 1 | False | Unknown | NaN | 100.0 >> 100.0 | 10.0 >> 10.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | Li-TFSI; TBP | NaN | Spin-coating | Unknown | Unknown | NaN | Au | 80.0 | Sputtering | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | Cooling | NaN | NaN | NaN | 10 |
| 285 | 2017-03-03 | 0.09 | pin | False | False | NaN | SLG | FTO | PCBM-60 | BCP | 100.0 | nan | Unknown | Spin-coating | Spin-coating | NaN | False | True | False | True | MA | 1 | Pb | 1 | I | 3 | NaN | NaN | NaN | False | 1 | Spin-coating | N2 | DMSO; GBL | 3; 7 | True | Chlorobenzene | NaN | 70.0 | 30.0 | Unknown | False | Unknown | NaN | NaN | NiO-c | 70.0 | Cu | NaN | Spin-coating | Unknown | Unknown | NaN | Ag | 150.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | 100.0 | NaN | NaN | NaN | NaN | NaN | NaN | 9 |
| 1198 | 2019-10-17 | 0.16 | nip | False | False | NaN | SLG | ITO | SnO2-c | NaN | Unknown | Unknown | NaN | False | True | False | True | Cs; FA; MA | 0.05; 0.79; 0.16 | Pb | 1 | Br; I | 0.51; 2.49 | PbI2 | NaN | NaN | False | 1 | Spin-coating | Unknown | DMF; DMSO | 4; 1 | True | Chlorobenzene | NaN | 100.0 | 60.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | NaN | NaN | Spin-coating | Unknown | Unknown | NaN | Au | NaN | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | 200.0 | NaN | NaN | NaN | NaN | NaN | NaN | 9 |
| 357 | 2017-07-14 | 0.10 | pin | False | False | NaN | SLG | ITO | PCBM-60 | BCP | NaN | Unknown | Spin-coating | Spin-coating | NaN | False | True | False | True | MA | 1 | Pb | 1 | I | 3 | MA | HPA | NaN | False | 1 | Spin-coating | Unknown | DMF | 1 | False | Unknown | NaN | 100 | 5.0 | Unknown | False | Unknown | NaN | NaN | PEDOT:PSS | NaN | NaN | NaN | Spin-coating | Unknown | Unknown | NaN | Ag | 80.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | 500.0 | NaN | NaN | Potential biasing | 2.0 | 1.2 | 14.50 | 8 |
| 358 | 2017-07-14 | 0.10 | pin | False | False | NaN | SLG | ITO | PCBM-60 | BCP | NaN | Unknown | Spin-coating | Spin-coating | NaN | False | True | False | True | MA | 1 | Pb | 1 | I | 3 | NaN | NaN | NaN | False | 1 | Spin-coating | Unknown | DMSO; GBL | 3; 7 | True | Toluene | NaN | 100 | 15.0 | Unknown | False | Unknown | NaN | NaN | PEDOT:PSS | NaN | NaN | NaN | Spin-coating | Unknown | Unknown | NaN | Ag | 80.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | 500.0 | NaN | NaN | Potential biasing | 2.0 | 1.2 | 10.50 | 8 |
| 838 | 2018-11-30 | 0.04 | nip | False | False | NaN | SLG | FTO | TiO2-c | 70.0 | ZnCl2 | Dipp-coating | NaN | False | True | False | True | MA | 1 | Pb | 1 | I | 3 | NaN | NaN | NaN | False | 1 | Spin-coating | N2 | DMSO; GBL | 3; 7 | True | Toluene | NaN | 100.0 | 10.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | NaN | NaN | Spin-coating | Unknown | Unknown | NaN | Au | 80.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 8 |
| 804 | 2018-11-03 | 0.16 | nip | False | False | NaN | SLG | FTO | TiO2-c | TiO2-mp | NaN | Unknown | Li-TFSI | Spray-pyrolys | Spin-coating | NaN | False | True | False | True | MA | 1 | Pb | 1 | I | 3 | NaN | NaN | NaN | False | 1 | Spin-coating | N2 | DMSO | 1 | True | Chlorobenzene | NaN | 100.0 | 45.0 | Unknown | False | Unknown | NaN | NaN | PTAA | NaN | Li-TFSI; TBP | NaN | Spin-coating | Unknown | Unknown | NaN | Au | 80.0 | Evaporation | Unknown | False | False | Unknown | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 7 |
| 106 | 2015-11-25 | 0.10 | pin | False | False | NaN | SLG | ITO | PCBM-60 | 50.0 | Unknown | Spin-coating | NaN | False | True | False | True | FA | 1 | Pb | 1 | Br; I | 3 | MAI | NaN | NaN | False | 2 | Spin-coating >> Spin-coating | Unknown | DMF >> IPA | 1 >> 1 | False | Unknown | NaN | 110.0 >> 110.0 | 1.0 >> 5.0 | Unknown | False | Unknown | NaN | NaN | PEDOT:PSS | 40.0 | NaN | NaN | Spin-coating | Unknown | Unknown | NaN | Al | 100.0 | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 6 |
| 393 | 2017-08-15 | 0.10 | nip | False | False | NaN | SLG | FTO | TiO2-c | 50.0 | Unknown | Spin-coating | NaN | False | True | False | True | MA | 1 | Pb | 1 | I | 3 | NaN | NaN | NaN | False | 1 | Spin-coating | Air | DMF | 1 | True | He | NaN | 100.0 | 10.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | NaN | NaN | Spin-coating | Unknown | Unknown | NaN | Au | NaN | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | NaN | NaN | NaN | 17.08 | 6 |
| 394 | 2017-08-15 | 0.10 | nip | False | False | NaN | SLG | FTO | TiO2-c | 50.0 | Unknown | Spin-coating | NaN | False | True | False | True | MA | 1 | Pb | 1 | I | 3 | NaN | NaN | NaN | False | 1 | Spin-coating | Air | DMF | 1 | True | N2 | NaN | 100.0 | 10.0 | Unknown | False | Unknown | NaN | NaN | Spiro-MeOTAD | NaN | NaN | NaN | Spin-coating | Unknown | Unknown | NaN | Au | NaN | Evaporation | Unknown | False | False | Unknown | 100.0 | AM 1.5 | NaN | NaN | NaN | NaN | NaN | NaN | 17.98 | 6 |